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STREPTOCOCCUS PYOGENES ANTIGENS 



5 FIELD OF THE INVENTION 

The present invention is related to antigens, more particularly 
a polypeptide antigen of Streptococcus pyogenes (also called 
group A Streptococcus (GAS) ) bacterial pathogen which may be 
useful for prophylaxis, diagnostic and/or therapy of 
10 streptococcal infection. 

BACKGROUND OF THE INVENTION 

' Streptococcal are gram ( + ) bacteria which are differentiated by 
group specific carbohydrate antigens A through O which are found 

15 at the cell surface. Streptococcus pyogenes isolates are further 
distinguished by type- specif ic M protein antigens. M proteins 
are important .virulence factors which are highly variable both 
in" molecular weights and in sequences. Indeed, more than 80-M 
protein types have been identified on the basis of antigenic 

20 differences. 

. Streptococcus pyogenes , is responsible, for. many diverse . infection 
types, including pharyngitis, erysipelas and impetigo, scarlet 
fever, "and invasive diseases such as bacteremia and necrotizing 
25 fasciitis and also toxic shock. A resurgence of invasive disease 
in recent years has been documented in many countries, including 
those in North America and Europe. Although the organism is 
sensitive to antibiotics, the high, attack rate and rapid onset 
of sepsis results in high morbidity and mortality. 

30 

To develop a vaccine that will protect individuals from 
Streptococcus pyogenes infection, efforts have concentrated on 
virulence factors such as the type-specific M proteins. However, 
the amino -terminal portion of M proteins was found to induce 
35 cross-reactive antibodies which reacted with human myocardium, 
tropomyosin, myosin, and vimentin, which might be implicated in 
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autoimmune diseases . Others have used recombinant techniques to 
produce complex hybrid proteins containing amino- terminal 
peptides of M proteins from different serotypes. However, a safe 
vaccine containing all Streptococcus pyogenes serotypes will be 
5 highly complex to produce and standardize. 

In addition to the serotype-specif ic antigens, other 
Streptococcus pyogenes proteins have generated interest as 
potential vaccine candidates. The C5a peptidase, which is 

10 expressed by at least Streptococcus pyogenes 40 serotypes, was 
shown to be immunogenic in mice, but its capacity to reduce the 
level of nasopharyngeal colonization was limited. Other 
investigators have also focused on the streptococcal pyrogenic 
exotoxins which appear to play an important role in pathogenesis 

15 of infection. Immunization with these proteins prevented the 
deadly symptoms of toxic shock, but did not prevent 
colonization. 

Therefore there remains an unmet need for Streptococcus pyogenes 
20 antigens that may be used vaccine components for prophylaxis, 
diagnostic and/or therapy of Streptococcus infection. 

SUMMARY OF THE INVENTION 

According to one aspect, the present invention provides an' 
25 isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, IS, 2 0 or 

fragments, analogues or derivatives thereof. 

30 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
95% identity to a second polypeptide comprising a sequence 
chosen from SEQ ID NOs: 2, 4, 6,- 8, 10, 12, 14, 16 , 20 or 
fragments, analogues or derivatives thereof. 

35 

In other aspects, there are provided novel polypeptides encoded 

2 
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by polynucleotides of the invention, vectors comprising 
polynucleotides of the • invention operably linked to an 
expression control region, as well as host cells transfected 
with said vectors, pharmaceutical or vaccine compositions and 
5 methods of producing polypeptides comprising culturing said host 
cells under conditions suitable for expression. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 

Figure 1 is the DNA sequence of BVH-P1 gene from serotype 3 S . 
pyogenes strain ATCC12384 with a secretion signal at position 1 
to 75; SEQ ID NO:l. 

15 Figure 2 is the amino acid sequence BVH-P1 protein from serotype 
3 5 . pyogenes strain ATCC12384 with a secretion signal at 
position 1 to 25; SEQ ID NO:2. 

Figure 3 is the DNA sequence of BVH-P1 gene from S. pyogenes 
20 strain LSPQ2699 (ATCC19615) with a secretion signal at position 1 
to 75; SEQ ID NO: 3. 

Figure 4 is the amino acid sequence BVH-P1 protein from 5 . pyogenes 
strain LSPQ2699 (ATCC19615) with a secretion signal at position 1 to 
25 25; SEQ ID NO: 4. 

Figure 5 is the DNA sequence of BVH-P1 gene from S . pyogenes 
strain SPY57 with a secretion signal at position 1 to 75; SEQ ID 
NO: 5. 

30 

Figure 6 is the amino acid sequence BVH-P1 protein from S . pyogenes 
strain SPY57 with a secretion signal at position 1 to 25; SEQ ID 
NO: 6. 
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Figure 7 is the DNA sequence of BVH-P1 gene from S. pyogenes 
strain B514 with a secretion signal at position 1 to 75; SEQ ID 
N0:7. 

5 Figure 8 is the amino acid sequence BVH-P1 protein from S . pyogenes 
strain B514 with a secretion signal at position 1 to 25; SEQ ID 
NO: 8. 

Figure 9 is the DMA sequence BVH-P1 gene without a secretion 
10 signal from serotype 3 S. pyogenes strain ATCC12384 ; SEQ ID 
NO: 9. 

Figure 10 is the amino acid sequence BVH-P1 protein without a 
secretion signal from serotype 3 S .pyogenes strain ATCC123 84 ; 
15 SEQ ID NO: 10, 

Figure 11 is the DNA sequence BVK-P1 gene without a secretion 
signal from serotype 3 S. pyogenes strain LSPQ2699 (ATCC19615) ; 
SEQ ID NO: 11. 

20 

Figure 12 is the amino acid sequence BVH-P1 protein without a 
secretion signal, from serotype 3 S. pyogenes strain LSPQ2699, 
(ATCC19615) ; SEQ ID NO: 12. 

25 Figure 13 is the DNA sequence BVH-P1 gene without a secretion 
signal from serotype 3 S .pyogenes strain SPY57 ; SEQ ID NO: 13. 

Figure 14 is the amino acid sequence BVH-P1 protein without a 
secretion signal from serotype 3 S .pyogenes strain SPY57 ; SEQ 
30 ID N0:14. 

Figure 15 is the DNA sequence BVH-P1 gene without a secretion 
signal from serotype 3 S. pyogenes strain B514 ; SEQ ID NO: 15. 
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Figure 16 is the amino acid sequence BVH-P1 protein without a 
secretion signal from serotype 3 S. pyogenes strain B514 ; SEQ ID 
NO:16. 

5 Figure 17 depicts the comparison of the nucleotide sequences of 
the BVH-P1 genes from ATCC123 84, LSPQ2699 (ATCC19615) , SPY57 , 
B514, ATCC 70029 (Oklahoma) and T28/51/4 (U09352) S. pyogenes 
strains by using the program Clustal W from MacVector sequence 
analysis software (version 6.5). Underneath the alignment, there 
10 is a consensus line. Shaded nucleotides are identical between 

every sequences and gaps in the sequence introduced by alignment 
are indicated by hyphens. 

Figure 18 depicts the comparison of the predicted amino acid 
15 sequences of the BVH-Pl open reading frames from ATCC12384, 
LSPQ2699 (ATCC19615) , SPY57 , B514, ATCC 70029 (Oklahoma) and 
T28/51/4 (U09352) S, pyogenes strains by using the program 
Clustal W from MacVector sequence analysis software 
(version 6.5). Underneath the alignment, there is a consensus 
20 line. Shaded amino acid residues are identical between every 
sequences and gaps in the sequence introduced by alignment are 
. . indicated by .hyphens . ........ 

Figure 19 is the DNA sequence of a gene' from S . pneumonia ; SEQ 
25 ID NO: 17. 

Figure 20 is the amino acid sequence of a protein from S. 
pneumonia ; SEQ ID NO: 18. 

30 

DETAILED DESCRIPTION OF THE INVENTION 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising a sequence 
35 chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 2 0 or 
fragments, analogues or derivatives thereof. 

5 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at ' least 
95% identity to a second polypeptide comprising a sequence 
5 chosen from SEQ" ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 20 or 
fragments, analogues or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence comprising 
10 SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 20 or fragments, 
analogues or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide capable of 
15 generating antibodies having binding specificity for a 
polypeptide comprising a sequence chosen from SEQ ID NOs: 2, 4, 
6, 8, 10, 12, 14, 16 , 2 0 or fragments, analogues or derivatives 
thereof . 

20 In accordance with the present invention, there is provided, a 
consensus nucleotide sequence depicted in Figure 17. As can be 
seen . .by... the . alignement , . . the polynucleotide, encoding . the 
polypeptide of the invention is well conserved. Without 
restricting " the scope of the' invention, the following table ■ 1 

25 shows the possible modifications. SEQ ID NO: 19 covers the 
consensus nucleotide sequence depicted in Figure 17 with the 
modifications illustrated in Table 1: 



Position on alignement in 
Figure 17 


Possible nucleotide 


21 


C or T 


53 


C or T 


69 


G or A 


103 


G or C 


149 


C or T 
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150 


A or T 


! 195 


G or A 


244 


T or C 


: 273 


A or C 


282 


T or C 


302 


C or A 


318 


A or G 


334 


G or T 


394 


C or T 


400 


G or A 


415 


C or T 


428-448 


[CTGATGTCCCAACGACACCAT] or 
1 none 


450 


C or A 


473 


C or T 


501 


G or A 


527 


T or C 


572 


T or A 


573 


T or A 


595 


A or C 


596 


C or G 


597 


G or C 


630 ... 


A - or G ... 


632 


A or C 


633 


C or T 


634 


C or T 


665 


A or G 


666 


G or A 


683 


T or C 


7 08 


C or T 


733 


[ CAGATGTT AACT] or none 


798 


T or C 


883 


G or none 


927 


T or A 
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930 


T or C 


943 


T or none 


952 


T or A 


955 


G or A 


964 


T, or C 


973 


G or A 


976 


T or G 


978 


A or T 


979 


A or T 


981 


A or G 


982 


T or C 


986 


' G or A 


988 


T or G 


1033 


G or C 


1034 


C or G 


1102 


C or T 


1143 


A or T 


1144 


A or T 


1 1145 


A or,,T 


1146 


A or T 



In accordance with the present invention, there is provided a 
consensus amino acid sequence depicted in Figure 18. As. can be 
seen by the alignement , the polypeptide of the invention is well 
conserved. Without restricting the scope of the invention, the 
following table 2 shows the possible modifications. SEQ ID NO: 20 
covers the consensus nucleotide sequence depicted in Figure 18 
with the modifications illustrated in Table 2 : 
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Position on alignement in 
Figure 18 , 


Possible amino acid 


18 


A or V 


35 


E or Q 


50 


T or I 


101 


T or N 


112 


A or S 


132 


P or S 


134 


V or I 


139 


S or P 


143 to 149 


SDVPTTP or none 


150 


F or L 


158 


S or F 


176 


L or s 


191 


V or E 


199 


T or P or S 


211 


D or A 


212 


P or S 


222 


E or G 


228 


V or A 


' '242' to 245 


ETSQ or none 


.246. 


E or M 


247 


T or L 


248 


S or T 


295 


A or L 


296 


S or L 


297 


A or P 


298 


F or L 


299 


G or V 


300 


I or L 


301 


T or R 


302 


S or H 


303. 


F or L 
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304 


S or V 


3.05 


G or V 


306 


Y or T 


307 


R or V 


308 


P or Q 


309 


G or E 


310 


D or I 


311 


P or Q 


312 


G. or E 


313 


D or I 


314 


H or I 


326 


E or V 


327 


N or S 


329 


A or T 


344 


E or D 


345 


R or G 


380 


E or V 


381 


N or F 



In accordance with the present invention, all polynucleotides 
encoding polypeptides are' within the scope of ' the present" 
5 invention . 

In a further embodiment, the polypeptides in accordance with the 
present invention are antigenic. 

10 In a further embodiment, the polypeptides in accordance with the. 
present invention are immunogenic. 

In a further embodiment, the polypeptides in accordance with the 
present invention can elicit, an immune response in . an 
15 individual . 

In a further embodiment, the present invention also relates to 
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. polypeptides which are able to raise antibodies having binding 
specificity to the polypeptides of the present invention as 
defined above . 

5 An antibody that "has binding specificity" is an antibody that 
recognizes and binds the selected polypeptide but which does not 
substantially recognize and bind other molecules in a sample, 
e.g., a biological sample. Specific binding can be measured 
using an ELISA assay in which the selected polypeptide is used 
10 as an antigen. 

In accordance with the present invention, "protection" in the 
biological studies is defined by a significant increase in the 
survival curve, rate or period. Statistical analysis using the 

15 Log rank test to compare survival curves, and Fisher ' exact test 
to compare survival rates and numbers of days to death, 
respectively, might be useful to calculate P values and 
determine whether the difference between the two groups is 
statistically significant. P values of 0.05 are regarded as not 

20 significant . 

... As used . herein,. . . "fragments" , . "analogues". . or . "derivatives " . of. 
the polypeptides of the invention include those polypeptides in 
which one or more of the amino acid residues are substituted 

25 with a conserved or non-conserved amino acid residue (preferably 
conserved) and which may be natural or unnatural . In one 
embodiment, derivatives and analogues of polypeptides of the 
invention will have about 70% identity with those sequences 
illustrated in the figures or fragments thereof. That is, 70% 

30 of the residues are the same. In a further embodiment, 
polypeptides will have greater than 75% homology. In a further 
embodiment, polypeptides will have greater than 80% homology. In 
a further embodiment, polypeptides will have greater than 85% 
homology. In a further embodiment, polypeptides will have 

35 greater than 90% homology. In a further embodiment, polypeptides 
will have greater than 95% homology. In a further embodiment, 

11 
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polypeptides will have greater than 99% homology. In a further 
embodiment, derivatives and analogues of polypeptides of the 
invention will have less than about 20 amino acid residue 
substitutions, modifications or deletions and more preferably 
5 less than 10. Preferred substitutions are those known in the 
art as conserved i.e. the substituted residues share physical or 
chemical properties such as hydrophobicity, size, charge or 
functional groups . 

10 The skilled person will appreciate that fragments, analogues or 
derivatives of the proteins or polypeptides of the invention 
will also find use in the context of the present invention, i.e. 
as antigenic/immunogenic material. Thus, for instance proteins 
or polypeptides which include one or more additions, deletions, 

15 substitutions or the like are encompassed by the present 
invention. In addition, it may be possible to replace one amino 
acid with another of similar "type". For instance replacing one 
hydrophobic amino acid with another hydropholic amino acid. 

20 One can use a program such as the CLUSTAL program to compare 
amino acid sequences. This program compares amino acid 
■ sequences and finds the optimal alignment by inserting ' spaces in 
either sequence as appropriate. It is possible to calculate 
amino acid identity or similarity (identity plus conservation of 

25 amino acid type) for an optimal alignment. A program like 
BLASTx will align the longest stretch of similar sequences and 
assign a value to the fit. It is thus possible to obtain a 
comparison where several regions of similarity are found, each 
having a different score. Both types of identity analysis are 

30 contemplated in the present invention. 

In an alternative approach, the analogues or derivatives could 
be fusion proteins, incorporating moieties which render 
purification easier, for example by effectively tagging the 
35 desired protein or polypeptide, it may be necessary to remove 

12 
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the "tag" or it may be the case that the fusion protein itself 
retains sufficient antigenicity to be useful. 

In an additional aspect of the invention there are provided 
5 antigenic/immunogenic fragments of the proteins or polypeptides 
of the invention, or of analogues or derivatives thereof. 

The fragments of the present invention should include one or 
more epitopic regions or be sufficiently similar to such regions 

10 to retain their antigenic/immunogenic properties. Thus, for 
fragments according to the present invention the degree of 
identity is perhaps irrelevant, since they may be 100% identical 
to a particular part of a protein or polypeptide, homologue or 
derivative as described herein. The key issue, once again, is 

15 that the fragment retains the antigenic/immunogenic properties. 

Thus, what is important for analogues, derivatives and fragments 
is that they possess at least a degree of the 
antigenicity/immunogenic of the protein or polypeptide from 
20 which they are derived. 

• Also included • are .polypeptides- which have fused thereto other 
compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) to 
25 increase half-life; leader or secretory amino acid sequences for 
ease of purification; prepro- and pro- sequences; and 
(poly) saccharides . 

Furthermore, in those situations where amino acid regions are 
30 found to be polymorphic, it may be desirable to vary one or more 
particular amino acids to more effectively mimic the different 
epitopes of the different streptococcus strains. 

Moreover, the polypeptides of the present invention can be 
35 modified by terminal -NH 2 acylation (eg. by acetylation, or 

13 
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thioglycolic acid amidation, terminal carbosy amidation, e.g.- 
with ammonia or methylamine) to provide stability, increased 
hydrophobicity for linking or binding to a support or other 
molecule . 

5 

Also contemplated are hetero and homo polypeptide mul timers of 
the polypeptide fragments, analogues and derivatives. These 
polymeric forms include, for example, one or more polypeptides 
that have been cross-linked with cross-linkers such as 
10 avidin/biotin, gluteraldehyde or dimethylsuperimidate . Such 
polymeric forms also include polypeptides containing two or more 
tandem or inverted contiguous sequences, produced from 
multicistronic mRNAs generated by recombinant DNA technology. 

15 Preferably, a fragment, analog or derivative of a polypeptide of 
the invention will comprise at least one antigenic region i.e. 
at least one epitope. 

In order to achieve the formation of antigenic polymers (i.e. 
20 synthetic mult infers) , polypeptides may be utilized having 

bishaloacetyl groups, nitroarylhalides , or the like, where the 
. reagents, being., .specific, for. . thio .groups. . Therefore,, .the. link. 

between two mercapto groups of the different peptides may be a 

single bond or may be composed of a linking group of at least 
25 two, typically at least four, and not more than 16, but usually 

not more than about 14 carbon atoms. 

In a particular embodiment, polypeptide fragments, analogues and 
derivatives of the invention do not contain a methionine (Met) 

30 starting residue. Preferably, polypeptides will not incorporate 
a leader or secretory sequence (signal sequence) . The signal 
portion of a polypeptide of the invention may be determined 
according to established molecular biological techniques. In 
general, the polypeptide of interest may be isolated from a 

35 streptococcal culture and subsequently sequenced to determine 
the initial residue of the mature protein and therefore the 

14 
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sequence of the mature polypeptide. 

According to another aspect, there are provided vaccine 
compositions comprising one or more streptococcal polypeptides 
5 of the invention in admixture with a pharmaceutically acceptable 
carrier diluent or adjuvant. Suitable adjuvants include oils 
i.e. Freund's complete or incomplete adjuvant; salts i.e. 
A1K{S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ) 2 , silica, kaolin, carbon 
polynucleotides i.e. poly IC and poly AU. Preferred adjuvants 
10 include QuilA and Alhydrogel . Vaccines of the invention may be 
administered parenterally by injection, rapid infusion, 
nasopharyngeal absorption, dermoabsorption, or bucal or oral. 
Pharmaceutically acceptable carriers also include tetanus 
toxoid. 

15 

The term vaccine is also meant to include antibodies. In 
accordance with the present invention, there is also provided 
the use of one or more antibodies having binding specificity for 
the polypeptides of the present invention for the treatment or 
20 prophylaxis of streptococcus infection and/or diseases and 
symptoms mediated by streptococcus infection. 

Vaccine compositions of the invention are used for the treatment 
or prophylaxis of streptococcal infection and/or diseases and 

25 symptoms mediated by streptococcal infection As described in 
P.R. Murray (Ed, in chief), E. J.. Baron, M.A. Pfaller, F.C. 
Tenover and R.H. Yolken. Manual of Clinical Microbiology, ASM 
Press, Washington, D.C. sixth edition, 1995, 1482p which are 
herein incorporated by reference. In one embodiment, vaccine 

30 compositions of the present invention are used for the 
prophylaxis or treatment of pharyngitis, erysipelas and 
impetigo, scarlet fever, and invasive diseases such as 
bacteremia and necrotizing fasciitis and also toxic shock. In 
one embodiment, vaccine compositions of the invention are used 

35 for the prophylaxis or treatment of streptococcus infection 
and/or diseases and symptoms mediated by streptococcus 

15 
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infection, in particular group A streptococcus (pyogenes) , group 
B streptococcus (GBS or agalactiae) , S .pneumoniae , dysgalactiae, 
uteris, nocardia as well as Staphylococcus aureus. In a further 
embodiment, the streptococcus ' infection is Streptococcus 
5 pyogenes . 

In a particular embodiment, vaccines are administered to those 
individuals at risk of streptococcus infection such as infants, 
elderly and immunocompromised individuals. 

10 

As used in the present application, the term " individuals 1 ' 
include mammals. In a further embodiment, the mammal is human. 

Vaccine compositions are preferably in unit dosage form of about - 
15 0.001 to 100 fxg/kg (antigen/body weight) and more preferably 
0.01 to 10 ng/kg and most preferably .0 . 1 to 1 |ig/kg 1 to 3 times 
with an interval of about 1 to 6 week intervals between 
immunizations . 

20 Vaccine compositions are preferably in unit dosage form of about 
0.1 [ig to 10 mg and more preferably lug to 1 mg and most 
preferably 10 to 100 (ig 1 to 3 times with an interval of about 1 ' 
to 6. week intervals between immunizations . 

25 According to another aspect, there are provided polynucleotides 
encoding polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8/ 10, 12, 14, 16, 20 or 
fragments, analogues or derivatives thereof. 

30 In one embodiment, polynucleotides are those illustrated in SEQ 
ID NOs: 1, 3, 5, 7, 9, 11 , 13, 15/ 19 which may include the open 
reading frames (ORF) , encoding polypeptides of the invention. 

It will be appreciated that the polynucleotide sequences 
35 illustrated in the figures may be altered with degenerate codons 
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yet still encode the polypeptides of the invention. Accordingly 
the present invention further provides polynucleotides which 
hybridize to the polynucleotide sequences herein above described 
(or the complement sequences thereof) having 50% identity 
5 between sequences. In one embodiment, at least 70% identity 
between sequences. In one embodiment, at least 75% identity 
between sequences. In one embodiment, at least 80% identity 
between sequences. In one embodiment, at least 85% identity 
between sequences. In one embodiment, at least 90% identity 
10 between sequences. In a further embodiment, polynucleotides are 
hybridizable under stringent conditions i.e. having at least 95% 
identity. In a further embodiment, more than 97% identity. 

Suitable stringent conditions for hybridation can be readily 
15 determined by one of skilled in the art (see for example 
Sambrook et al . , (1989) Molecular cloning : A Laboratory Manual, 
2 nd ed,' Cold Spring Harbor, N.Y. ; Current Protocols in Molecular 
Biology, (1999) Edited by Ausubel F.M. et al . , John Wiley & 
Sons, Inc . , N.Y.) . 

20 

In a further embodiment, the present invention provides 

i 

.. . .polynucleotides, that, hybridise under, stringent . . conditions, to... 
either 

(a) a DNA sequence encoding a polypeptide or 
25 (b) the complement of a DNA sequence encoding a polypeptide; 

wherein said polypeptide comprising a sequence chosen from SEQ ID 
NOs: 2, 4, 6, 8, 10, 12, 14/ 16, 20 or fragments or analogues 
thereof . 



30 In a further embodiment, the present invention provides 
polynucleotides that hybridise under stringent conditions to ' 
either 

(a) a DNA sequence encoding a polypeptide or 

(b) the complement of a DNA sequence encoding a polypeptide; 

35 wherein said polypeptide comprises at least 10 contiguous amino 
acid residues from a polypeptide comprising a sequence chosen from 
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SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 20 or fragments or 
analogues thereof . 

In a further embodiment , polynucleotides are those encoding 
5 polypeptides of the invention illustrated in SEQ ID NOs: 2, 4, 
6, 8, 10, 12, 14, 16, 20. 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NOs : 1, 3, 5, 7, 9, 11, 13, 15, 19 encoding 
10 polypeptides of the invention. 

As will be readily appreciated by one skilled in the art, 
polynucleotides include both DNA and RNA. 

15 The present invention also includes polynucleotides 
complementary to the polynucleotides described in the present 
application. 

In a further aspect, polynucleotides encoding polypeptides of 
20 the invention, or fragments, analogues or derivatives thereof, 

may be used in a DNA immunization method. That is, they can be 
• .incorporated . into - a vector , which, is . replicable. and expressible . . 

upon injection thereby producing the antigenic polypeptide in 

vivo. For example polynucleotides may be incorporated into " a 
25 plasmid vector under the control of the CMV promoter which is 

functional in eukaryotic cells. Preferably the vector is 

injected intramuscularly. 

According, to another aspect, there is provided a process for . 

30 producing polypeptides of the invention by recombinant 
techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
polypeptide product. Alternatively, the polypeptides can be 
produced according to established synthetic chemical techniques 

35 i.e. solution phase or solid phase synthesis of oligopeptides 
which are ligated to produce the full polypeptide (block 
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General methods for obtention and evaluation of polynucleotides 
and polypeptides are described in the following references: 
5 Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd- ed, 
Cold Spring Harbor, N.Y., 1989; Current Protocols in Molecular 
Biology, Edited by Ausubel F.M. et al . , John Wiley and Sons, 
Inc. New York; PCR Cloning Protocols, from Molecular Cloning to 
Genetic Engineering, Edited by White B.A., Humana Press, Totowa, 
10 New Jersey, 1997, 490 pages; Protein Purification, Principles 
and Practices, Scopes R.K., Springer- Verlag, New York, 3rd 
Edition, 1993, 380 pages; Current Protocols in Immunology, 
Edited by Coligan J.E. et al . , John Wiley & Sons Inc., New York 
which are herein incorporated by reference. 

15 

For recombinant production, host cells are transfected with 
vectors which encode the polypeptide, and then cultured in a 
nutrient media modified as appropriate for activating promoters, 
selecting transf ormants or amplifying the genes. Suitable 

20 vectors are those that are viable and replicable in the chosen 
-host and include chromosomal, non- chromosomal and synthetic DNA 
sequences e.g.. bacterial, plasmids, - phage .DNA, baculovirus,. yeast . 
plasmids, vectors derived from combinations of plasmids and 
phage DNA. The polypeptide ' sequence may be incorporated in the 
' 25 vector at the appropriate site using restriction enzymes such 
that it is operably linked to an expression control region 
comprising a promoter, ribosome binding site (consensus region 
or Shine-Dalgarno sequence), and "optionally an operator (control 
element) . One can select individual components of the 

30 expression control region that are appropriate for a given host 
and vector according to established molecular biology principles 
(Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd ed, 
Cold Spring Harbor, N.Y., 1989; Current Protocols in Molecular 
Biology, Edited by Ausubel F.M. et al . , John Wiley and Sons, 

35 Inc. New York incorporated herein by reference). Suitable 
promoters include but are not limited to LTR or SV4 0 promoter, 
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E.coli lac, tac or trp promoters and the phage lambda P L 
promoter. Vectors will preferably incorporate an origin of 
replication as well as selection markers i.e. ampicilin 
resistance gene. Suitable bacterial vectors include pET, pQE70, 
5 pQE60, . pQE-9, pbs, pDIO phagescript, psiX174, pbluescript SK, 
pbsks, pNH8A, pNH16a, pNH18A, pNH46A, ptrc99a, pKK223-3, pKK233- 
3, pDR540> pRIT5 and eukaryotic vectors pBlueBacIIl, pWLNEO, 
pSV2CAT, pOG44, pXTl , pSG, pSVK3 , pBPV, pMSG and pSVL. Host 
cells may be bacterial i.e. E.coli , Bacillus subtilis , 
10 Streptomyces ; fungal i.e. Aspergillus niger , Aspergillus 
nidulins ; yeast i.e. Saccharomyces or eukaryotic i.e. CHO, COS. 

Upon expression of the polypeptide in culture, cells are 
typically harvested by centrif ugation then disrupted by physical 

15 or chemical means (if the expressed polypeptide is not secreted 
into the media) and the resulting crude extract retained to 
isolate the polypeptide of interest. Purification of the 
polypeptide from culture media or lysate may be achieved by 
established techniques depending on the properties of the 

20 polypeptide i.e. using ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange 

. -■ ■ . chromatography, . . phosphocellulose . . .chromatography, .... hydrophobic 
interaction chromatography, hydroxylapatite chromatography and 
lect in chromatography. Final purification may be achieved using 

25 HPLC. 

The polypeptide may be expressed with or without a leader or 
secretion sequence. In the former case the leader may be 
removed using post - translational processing (see US 4,431,739; 
30 US 4,425,437; and US 4,338,397 incorporated herein by reference) 
or be chemically removed subsequent to purifying the expressed 
polypeptide . 

According to a further aspect, the streptococcal polypeptides of 
35 the invention may be used in a diagnostic test for streptococcus 
infection, in particular Streptococcus pyogenes infection. 

20 



WO 02/04495 



CA 02413576 2002-12-20 



PCT/CA01/01001 



Several diagnostic methods are possible, for example detecting 
streptococcus organism in a biological sample, the following 
procedure may be followed: 

a) obtaining a biological sample from an individual; 

5 b) incubating an antibody or fragment thereof reactive with a 
streptococcus polypeptide of the invention with the 
biological sample to form a mixture; and 
c) detecting specifically bound antibody or bound fragment in 
the mixture which indicates the presence of streptococcus. 

10 

Alternatively, a method for the detection of antibody specific 
to a streptococcus antigen in a biological sample containing or 
suspected of containing said antibody may be performed as 
follows : 

15 a) obtaining a biological sample from an individual; 

b) incubating one or more streptococcus polypeptides of the 
invention or fragments thereof with the biological sample 
to form a mixture; and 

c) detecting specifically bound antigen or bound fragment in 
20 the mixture which indicates the presence of antibody 

specific to streptococcus. 

One of skill in the art will recognize that this diagnostic test 
may take several forms, including an immunological test such as 
25 an enzyme-linked immunosorbent assay (ELISA) , a radioimmunoassay 
or a latex agglutination assay, essentially to determine whether 
antibodies specific for the protein are present in an 
individual. 

30 The DNA sequences encoding polypeptides of the invention may 
also be used to design DNA probes for use in detecting the 
presence of streptococcus in a biological sample suspected of 
containing such bacteria. The detection method of this 
invention comprises: 

35 a) obtaining the biological sample from an individual; 

b) incubating one or more DNA probes having a DNA sequence 
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encoding a polypeptide of the invention or fragments 
thereof with the biological sample to form a mixture; and 
c) detecting specifically bound DNA probe in the mixture which 
indicates the presence of streptococcus bacteria. 

5 

The DNA probes of this invention may also be used for detecting 
circulating streptococcus i.e. Streptococcus pyogenes nucleic 
acids in a sample, for example using a polymerase chain 
reaction, as a method of diagnosing streptococcus infections. 

10 The probe may be synthesized using conventional techniques and 
may be immobilized on a solid phase, or may be labelled with a 
detectable label. A preferred DNA probe for this application is 
an oligomer having a sequence complementary to at least about 6 
contiguous nucleotides of the Streptococcus pyogenes 

15 polypeptides of the invention. 

Another diagnostic method for the detection of streptococcus in 
an individual comprises: 

a) labelling an antibody reactive with a polypeptide of the 
20 invention or fragment thereof with a detectable label; 

b) administering the labelled antibody or labelled fragment to 
the .patient;, and. .- . 

c) detecting specifically bound labelled antibody or labelled 
fragment in the patient which ' indicates the presence of 

25 streptococcus. 

A further aspect of the invention is the use of the 
streptococcus polypeptides of the invention as immunogens .for 
the production of specific antibodies for the diagnosis and in 

30 particular the treatment of streptococcus infection. Suitable 
antibodies may be determined using appropriate screening 
methods, for example by measuring the ability of a particular 
antibody to passively protect against streptococcus infection in 
a test model. One example of an animal model is the mouse model 

35 described in the examples herein. The antibody may be a whole 
antibody or an antigen-binding fragment thereof and may belong 
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to any immunoglobulin class. The antibody or fragment may be of 
animal origin, specifically of mammalian origin and more 
specifically of murine, rat or human origin. It may be a 
natural antibody or a fragment thereof, or if desired, a 
5 recombinant antibody or antibody fragment. The term recombinant 
antibody or antibody fragment means antibody or antibody 
fragment which was produced using molecular biology techniques. 
The antibody or antibody fragments may be polyclonal, or 
preferably monoclonal. It may be specific for a number of 
10 epitopes associated with the Streptococcus pyogenes polypeptides 
but is preferably specific for one. 

A further aspect of the invention is the use of the antibodies 
directed to the streptococcus polypeptides of the invention for 

15 passive immunization. One could use the antibodies described in 
the present application. Suitable antibodies may be determined 
using appropriate screening methods, for example by measuring 
the ability of a particular antibody to passively protect 
against streptococcus infection in a test model. One example of 

20 an animal model is the mouse model described in the examples 
herein. The antibody may be a whole antibody or an antigen- 
. binding' fragment., thereof .and. may. belong- .to any. immunoglobulin 
class. The antibody or fragment may be of animal origin, 
specif ically of mammalian origin and- more specifically "of 

25 murine, rat or human origin. It may be a natural antibody or a 
fragment thereof, or if desired, a recombinant antibody or 
antibody fragment. The term recombinant antibody or antibody 
fragment means antibody or antibody fragment which was produced 
using molecular biology techniques. The antibody or antibody 

30 fragments may be polyclonal, or preferably monoclonal. It may 
be specific for a number of epitopes associated with the 
streptococcus pneumoniae polypeptides but is preferably specific 
for one . 

35 Unless otherwise defined, all technical and scientific terms 
used herein have the same meaning as commonly understood by one 
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of ordinary . skill in the art to which this invention belongs. 
All publications, patent applications, patents, and other 
references mentioned herein are incorporated by reference in 
their entirety. In case of conflict, the present specification, 
5 including definitions, will control. In addition, the materials, 
methods, and examples are illustrative only and not intended to 
be limiting. 

10 

EXAMPLE 1 

This example illustrates the cloning of pyogenes gene. 

The coding region of pyogenes gene BVH-P1 (SEQ ID N0:1) was 

15 amplified by PCR (DNA Thermal Cycler GeneAmp PCR system 2400 
Perkin Elmer, San Jose, CA) from genomic DNA of serotype 3 S. 
pyogenes strain ATCC123 84 using the following oligos that 
contained base extensions for the addition of restriction sites 
Ncol (CCATGG) and Xhol (CTCGAG) : DMAR16 (5»- 

20 CAGG C CATGGAGTGGACACCACGAT CGGTTAC - 3 ' ) ; DMAR1 7 ( 5 » - 

GCCGCTCGAGAGCATTAAAGGAGACATGAACATGATC-3 « ) . PCR products were 
.. purified from .agarose gel. using, a- QIAquick. gel. .extraction . kit., 
from QIAgen following the manufacturer's instructions 
(Chatsworth, CA) , and digested with Ncol and - Xhol (Pharmacia 

25 Canada Inc, Baie d'Urfe, Canada) . The pET-21d(+) vector 
(Novagen, Madison, WI) was digested with Ncol and Xhol and 
purified from agarose gel using a QIAquick gel extraction kit 
from QIAgen (Chatsworth, CA) . The Ncol-Xhol PCR products were 
ligated to the Ncol-Xhol pET- 2 Id (+) expression vector. The 

30 ligated products were transformed into coli strain coli 

strain DH5a [(|)8 0dlacZAM15 A (JacZYA-argrF) U169 endAl recAl 
hsdR17 (r K -m K +) deoR thi-1 supE44 X~gyrA96 relAl] (Gibco BRL, 
Gaithersburg, MD) according to the method of Simanis (Hanahan, 
D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 109-135). 

35 Recombinant pET-21d(+) plasmid (rpET21d (+) ) containing BVH-P1 
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gene .was purified using a QIAgen plasmid kit (Chatsworth, CA) 
and DNA insert was sequenced (Taq Dye Deoxy Terminator Cycle 
Sequencing kit, ABI , Foster City, CA) . 

i 

5 It was determined that the open reading frame (ORF) which codes 
for BVH-P1 contains 1170 -bp and encodes a 389 amino acid 
residues polypeptide with a predicted pi of 4.37 and a predicted 
molecular mass of 41054 Da. 

Analysis of the predicted amino acid residues sequence (SEQ ID 
10 NO: 2) using the Spscan sofware (Wisconsin Sequence Analysis 
Package; Genetics Computer Group) suggested the existence of a 
2 5 amino acid residues signal peptide 

( MI ITKKSLFVTS VALS LAPLATAQA ) , which ends with a cleavage site 
situated between an alanine and a glutamine residues. Analysis 
15 of this ORF did not revealed the presence of repetitive 
structures, cell wall anchoring motif (LPXTG) , or IgA binding 
motif (MLKKIE) . 

An ORF which shares 62% with the pyogenes BVH-P1 gene was 

20 initially presented in the patent application PCT/CA99/00114 

which described Group B streptococcus antigens. BVH-P1 gene was 
.- • . also . found tp .share, .homology (.62%. identity) .with. an. ORF present 

in . the genome of S^ pneumoniae (The Institute for Genomic 

Research) . 

25 

EXAMPLE 2 

This example describes the PCR amplification and sequencing of 
BVH-P1 gene from other's^ pyogenes strains and the evaluation of 
the level of molecular conservation of this gene. 

30 

Lancefield's 'serogroup A pyogenes LSPQ2296 (ATCC 19615) was 
provided by the laboratoire de la sante publique du Quebec, 
Sainte-Anne-de-Bellevue ; serotype 1 pyogenes SPY57 clinical 

isolate was provided by the centre de recherche . en inf ectiologie 
35 du centre hospitalier de 1' university Laval, Sainte-Foy; and S . 
pyogenes strain B514 which was initially isolated from a mouse 
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was provided by Susan Hollingshead, from University of Alabama, 
Birmingham. The respective coding region of pyogenes gene 

BVH-P1 from strains ATCC 12384 (SEQ ID NO:l) f 
LSPQ2699 (ATCC19615) (SEQ ID NO:3), SPY57 (SEQ ID NO:-5) , and B514 
5 (SEQ ID NO: 7) were amplified by PCR (DNA Thermal Cycler GeneAmp 
PCR system 2400 Perkin Elmer, San Jose, CA) from bacterial cell 
lysates using the following oligos DMAR69 (5'- 
CTGGGAAGATTATCTAGCACATTAATAC-3 1 ) ; DMAR72 (5 1 - 

CATAACGTTAAAACTGTCTAAAGGG-3 ' ) . PCR products were purified from 

10 agarose gel using a QIAquick gel extraction kit from QIAgen 
following the manufacturer's instructions (Chatsworth, CA) and 
the DNA insert were sequenced (Taq Dye Deoxy Terminator Cycle 
Sequencing kit, ABI , Foster City, CA) , The predicted amino acid 
sequences from strains ATCC12384 (SEQ ID N0:2), 

15 LSPQ2699 (ATCC19615) (SEQ ID NO : 4 ) , SPY57 (SEQ ID NO : 6 ) , and B514 
(SEQ ID NO: 8) were respectively presented in the following 
figures 2, 4, 6, and 8. 

The figures 17 and 18 respectively depict the consensus 
20 nucleotide and predicted amino acid sequences established for S . 
pyogenes BVH-P1. In addition to the sequences presented 
.... herewith,, the BVH-P1. gene sequences . from. the. genome, sequencing 
project at the University of Oklahoma (serotype Ml S^ pyogenes 
strain ATCC ' 70029 : http://dnal-chem.6u.edu/strep.html ' ) and 
25 from (Kil et al . 1994. Infect. Immun. 62 : 2440-2449 : GenBank 
accession number U09352) were also included. No function or role 
in the pathogenesis of the bacteria or protection against 
infection was described by Kil et al . for the sequence with 
GenBank accession number U09352. This latter sequence presented 
30 by Kil et al . was shown to be located upstream of a S. pyogenes 
67kDa myosin-cross-reactive antigen. 

Pairwise comparison of the BVH-P1 predicted protein sequences 
revealed between 95 to. 100% identity with the exception of the 
35 BVH-P1 sequence obtained from GenBank under the accesssion 
number U09352. Pairwise comparison of that particular sequence 
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with the other five BVH-P1 sequences indicated identity between 
87 to 91%. This lower homology can be explained by the presence 
of two regions (119-124 and 262-281) which are more divergent 
comparatively to the other BVH-P1 gene sequences. Beside these 
5 two regions in the BVH-P1 sequence obtained from GenBank under 
the accesssion number U09352, the BVH-P1 genes showed great 
similarity in overall organization. 

10 EXAMPLE 3 

This example illustrates the cloning of pyogenes protein gene 
in CMV plasmid pCMV-GH. 

The DNA coding region of a pyogenes protein was inserted in 
15 phase downstream of a human growth hormone (hGH) gene which was 
under the transcriptional control of the cytomegalovirus (CMV) 
promotor in the plasmid vector pCMV-GH (Tang et al., Nature, 
1992, 356 :152). The CMV promotor is a non functional plasmid 
in coli cells but is active upon administration of the 

20 plasmid in eukaryotic cells. The vector also incorporated the 
ampicillin resistance gene. 

The coding region of BVH-P1 gene (SEQ ID NO: 9) without its 
leader peptide region was amplified by PCR (DNA Thermal Cycler 

25 GeneAmp PCR system 2400 Perkin Elmer, San Jose, CA) from genomic 
DNA of serotype 3 pyogenes strain ATCC123 84 using the 

following oligos that contained base extensions for the addition 
of restriction .sites BamHI (GGATCC) and Sail (GTCGAC) : DMAR24 
( 5 ' - TACCCGGATCCCCAAGAGTGGAC ACCACGATCGG - 3 ■ ) ; DMAR2 5 ( 5 ' - 

30 GCGCTCGTCGACGCGTATCTCAGCCTCTTATAGGGC-3 r ) . The PCR product was 
purified from agarose gel using a QIAquick gel extraction kit 
from QIAgen (Chatsworth, CA) , digested with restriction enzymes 
(Pharmacia Canada Inc, Bale d'Urfe, Canada) . The pCMV-GH vector 
(Laboratory of Dr. Stephen A. Johnston, Department of 

35 Biochemistry, The University of Texas, Dallas, Texas) was 
digested with BamHI and Sail and purified from agarose gel using 
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the QJAquick gel extraction kit from QIAgen (Chatsworth, CA) . 
The BamHI-Sall DNA fragments were ligated to the BamHl-Sall 
pCMV-GH vector to create the hGH-BVH-Pl fusion protein under the 
control of the CMV promoter. The ligated products were 
5 transformed into E_;_ coli strain DH5a [<j)80d!acZAM15 A(lacZYA- 
argF)U169 endAl recAl hsdRll (r K -m K +) deoR thi-1 supE44 X~gyrA96 
relAl] (Gibco BRL, Gaithersburg, MD) according to the method of 
Simanis (Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 
109-135) . The recombinant pCMV plasmid was purified using a 
10 QIAgen plasmid kit (Chatsworth, CA) and the nucleotide sequence' 
of the DNA insert was verified by DNA sequencing. 

EXAMPLE 4 

15 This example illustrates the use of DNA to elicit an immune 
response to pyogenes antigens. 

A group of 8 female BALB/c • mice (Charles River, St-Constant, 
Quebec, Canada) were immunized by intramuscular injection of 100 

20 fil three times at two- or three-week intervals with 50 /xg of 
recombinant pCMV-GH encoding BVH-P1 gene in presence of 50 fxg of 
• granulocyte -macrophage- ' colony- stimulating • 'factor' ' (GM-CSF)--- 
expressing plasmid pCMV-GH-GM-CSF (Laboratory of Dr. Stephen A. 
Johnston, Department of Biochemistry, The University of Texas, 

25 Dallas, Texas) . As control, a group of mice were injected with 
5 0 /xg of pCMV-GH in presence of 50 /xg of pCMV-GH -GM-CSF . Blood 
samples were collected from the orbital sinus prior to each 
immunization and seven days following the third injection and 
serum antibody responses were determined by ELISA using purified 

30 BVH-Pl-His»Tag from SEQ ID NO: 11 pyogenes recombinant protein 
as coating antigen. 
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EXAMPLE 5 

This example illustrates the production and purification of 
recombinant pyogenes BVH-P1 protein. 

5 The recombinant pET-21d (+) plasmid with BVH-P1 gene corresponding 
to the SEQ ID NO: 9 was used to transform by electroporation 
(Gene Pulser II apparatus, BIO-RAD Labs, Mississauga, Canada) E . 
coli strain BL21(DE3) (F'ompT hsdS B (r* B m" B ) gal dcm (DE3) ) 
(Novagen, Madison, WI) . In this strain of Ej_ coli , the T7 

10 . promoter controlling expression of the recombinant protein is 
specifically recognized by the T7 RNA polymerase (present on the 
XDE3 prophage) whose gene is under the control of the lac 
promotor which is inducible by isopropyl-£-d-thio- 
galactopyranoside (IPTG) . The transformant BL21 (DE3) /rpET was 

15 grown at 37°C with agitation at 250 rpm in LB broth (peptone 
lOg/L, yeast extract 5g/L, NaCl lOg/L) containing 100 fig of 
carbenicillin ( Sigma- Aldrich Canada Ltd. , Oakville, Canada) per 
ml until the A 60 o reached a value of 0.6. In order to induce the 
production of pyogenes BVH-Pl-His«Tag recombinant protein 

20 (from SEQ -ID NO: 10), the cells were incubated for 3 additional 
hours in the presence of IPTG at a final concentration of 1 mM. 
' '-Induced ' ■ cells ■. ■ from ' a 500 ■ ml " culture were pelleted'' ' by 
cent r if ugation and frozen at -7 0°C. 

25 The purification of the recombinant, proteins from the soluble 
cytoplasmic fraction of IPTG-induced BL21 (DE3) /rpET21b{+) was 
done by affinity chromatography based on the properties of the 
His*Tag sequence (6 consecutive histidine residues) to bind to 
divalent cations (Ni 2+ ) immobilized on the His»Bind metal 

30 chelation resin. Briefly, the pelleted cells obtained from a 
5 00 mL culture induced with IPTG was resuspended in lysis buffer 
(20 mM Tris, 500 mM NaCl, 10 mM imidazole, pH 7.9) containing 
ImM PMSF, sonicated and centrifuged at 12,000 X g for 2 0 min to 
remove debris. The supernatant was deposited on a Ni-NTA 

35 agarose column (Qiagen, Mississauga, Ontario, Canada) . The 
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pyogenes BVH-Pl-His»Tag recombinant protein (from SEQ ID MO: 10) 
was eluted with 250 mM imidazole- 500mM NaCl-20 mM Tris pH 7.9. 
The removal of the salt and imidazole from the sample was done 
by dialysis against PBS at 4°C. The quantities of recombinant 
5 protein obtained from the soluble fraction of coli was 

estimated by MicroBCA (Pierce, Rockford, Illinois) . 

EXAMPLE 6 

10 This example illustrates the accessibility to antibodies of the 
BVH-P1 protein at the surface of pyogenes strain. 

Bacteria were grown in Tood Hewitt (TH) broth (Difco 
Laboratories, Detroit MI) with 0.5% Yeast extract (Difco 

15 Laboratories) and' 0.5% peptone extract (Merck, Darmstadt, 
Germany) at 37°C in a 8% C0 2 atmosphere to give an OD 49 onni of 0.600 
(~10 8 CFU/ml) . Dilutions of anti-BVH-Pl or control sera were then 
added and allowed to bind to the cells, which were incubated for 
2 h at 4°C. Samples were washed 4 times in blocking buffer 

20 [phosphate-buffered saline (PBS) containing 2% bovine serum 
albumin (BSA) ] , and then 1 ml of goat fluorescein (FITC) - 
conjugated anti-mouse IgG + IgM diluted in blocking buffer was 
.... added. After an . additional . incubation • of . 6.0 min. . at . room 
temperature, samples were washed 4 times in blocking buffer and 

25 fixed with 0.25 % formaldehyde in PBS buffer for 18-24 h at 4°C. 
Cells were washed 2 times in PBS buffer and resuspended in 500 
/il of PBS buffer. Cells were kept in the dark at 4°C until 
analyzed by flow cytometry (Epics® XL; Beckman Coulter, Inc.). 
Flow cytometric analysis revealed that BVH-P1 -specific 

30 antibodies efficiently recognized their corresponding surface 
exposed epitopes on both the homologous (ATCC12384; serotype3) 
and the heterologous (SPY57 / seotype 1) pyogenes strains 

tested. It was determined that more than 90 % of the 10, 000 S . 
pyogenes cells analyzed were labeled with the antobodies present 

35 in the BVH-MC1 specific anti-sera. These observations clearly 

30 
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demonstrate that the BVH-P1 protein is accessible at the surface 
where it can be easily recognized by antibodies. Anti- S. 
pyogenes antibodies were shown to play an important role in the 
protection against pyogenes infection. 

5 

EXAMPLE 7 

This example illustrates the protection against fatal S. 
pyogenes infection induced by passive immunization of mice with 
10 rabbit hyper- immune sera. 

New Zealand rabbits (Charles River laboratories, Montreal, 
Canada) were injected subcutaneously at multiple sites with 
approximately 50 jig and 100 \ig of BVH-Pl-His«Tag protein (from 

15 SEQ ID NO: 10) that was produced and purified as described in 
Example 5 and adsorbed to Alhydrogel adjuvant (Superfos 
Biosector a/s) . Rabbits were immunized three times at three- 
week intervals with the BVH-Pl -His»Tag protein (from SEQ ID 
NO: 10). Blood samples were collected three weeks after the 

20 third injection. The antibodies present in the serum were 
purified by precipitation using 40% saturated ammonium sulfate. 
■Groups of 10' female' CD-T mice (Charles' River) "were injected' 
intravenously with 500 fxl of purified serum collected either 
from BVH-Pl-His»Tag (from SEQ ID NO:10) immunized rabbits or 

25 rabbits immunized with an unrelated control recombinant protein. 
Eighteen hours later the mice were challenged with approximately 
2xl0 7 CFU of the type 3 pyogenes strain ATCC12384. Samples of 
the S^ pyogenes challenge inoculum were plated on blood agar 
plates to determine the CFU and to verify the challenge dose. 

30 Deaths were recorded for a period of 5 days . 

EXAMPLE 8 

This example illustrates the protection of mice against fatal S^ 
35 pyogenes infection induced by immunization with BVH-P1 protein. 

31 
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Groups of 8 female CD-I mice (Charles River) were immunized 
subcutaneously three times at three-week intervals with 20 /ig of 
affinity purified pyogenes BVH-Pl-His*Tag recombinant protein 
5 (from SEQ ID NO:10) in presence of 10 /ig of QuilA adjuvant 
(Cedarlane Laboratories Ltd, Hornby, Canada), or, as control, 
with QuilA adjuvant alone in PBS. Blood samples were collected 
from the orbital sinus on day 1, 22 and 43 prior to each 
immunization and seven days (day 50) following the third 

10 injection. Analysis by ELISA using purified recombinant BVH-P1 
protein (from SEQ ID NO: 10) clearly indicated that this protein 
is highly immunogenic in animals. Indeed reciprocal ELISA titers 
higher than 10 6 were determined for the mice immunized with this 
recombinant protein. Two weeks later the mice were challenged 

15 with approximately 2x1 0 7 CFU of the type 3 S^ pyogenes strain 
ATCC123 84. Samples of the S^ pyogenes challenge inoculum were 
plated on' blood agar plates to determine the CFU and to verify 
the challenge dose. Deaths were recorded for a period of 5 
days . Five out of the 8 (62%) mice immunized with three 

20 injections of 2 0 jig of purified recombinant BVH-P1 (from SEQ ID 
NO: 10) and QuilA adjuvant survived the bacterial challenge to 
• .only 2/7 (28%) in the control group. 

Table 3. Immunization of CD-I mice with purified recombinant 
25 BVH-P1 protein confers protection against subsequent challenge 
with S^ pyogenes strain ATCC 12384 



Groups 


Survival of the mice challenged \ 
pyogenes strain ATCC 12384 (Day 
challenge: number of survivors/ 
number of mice challenged) 


tfith S . 
after 
total- 




1 


2 


3 


4 


5 


2 0 }xg of 

BVH-P1- 

His*Tag 


8/8 


8/8 


7/8 


6/8 


5/8 


Control 


7/7 


6/7 


3/7 


2/7 . 


2/7 
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SEQUENCE LISTING 

<110> SHIRE BIOCHEM INC. 
MARTIN, Denis 
HAKEL, Josee 
BRODEUR , Bernard 

<12 0> STREPTOCOCCUS PYOGENES ANTIGENS 



-<130> 12806-20PCT 



<150> US 60/216,465 
<151> 2000-07-06 



<160> 29 



<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1170 
<212> DMA 
<213> S. pyogenes 



<400> 1 

atgattatta ctaaaaagag cttatttgtg acaagtgtcg ctttgtcgtt agcacctttg 60 

gcgacagcac aggcacaaga gtggacacca cgatcggtta cagaaatcaa gtctgaactc 120 

gtcctagttg ataatgtttt tacttatact gtaaaatacg gtgacacttt aagcacaatt 180 

gctgaagcaa tgggaattga tgtgcatgtc ttaggagata ttaatcatat tgctaatatt 240 

gacttaattt ttccagacac gatcctaaca gccaactaca accaacacgg tcaggcaacg 3 00 

actttgacgg ttcaagcgcc tgcttctagt ccagctagcg ttagtcatgt acctagcagt 360 

gagccattac cccaagcatc tgccacctct caatcgactg ttcctatggc accatctgcg 420 

acaccatctg atgtcccaac gacaccattc gcatctgcaa agccagatag ttctgtgaca 480 

gcgtcatctg agctcacatc gtcaacgaat gatgtttcga ctgagttgtc tagcgaatca 540 

caaaagcagc cagaagtacc acaagaagca gttccaactc ctaaagcagc tgaaacgact 600 

gaagtcgaac ctaagacaga catctcagag gattcaactt cagctaatag gcctgtacct 660 

aacgagagtg ettcagaaga agtttcttct gcggccccag cacaagcccc agcagaaaaa . ' 720 

gaagaaacct ctgcgccagc agcacaaaaa gctgtagctg acaccacaag tgttgcaacc 780 

tcaaatggcc tttcttacgc tccaaaccat gcctacaatc caatgaatgc agggcttcaa , 840 

ccacaaacag cagccttcaa agaagaagtg gcttctgcct ttggtattac gtcatttagt ■• 900 

ggttaccgtc caggtgatcc aggagatcat ggtaaaggtt tggccattga ttttatggtg 960 

cctgaaaatt ctgctcttgg tgatcaagtt gctcaatatg ccattgacca tatggcagag 1020 

cgtggtattt catacgttat ttggaaacag cgattctatg cgccatttgc aagtatttac 1080 

ggaccagcct acacatggaa ccccatgcca gatcgcggca gtattacaga aaaccattat 1140 

gatcatgttc atgtctcctt taatgcttaa 1170 



<210> 2 

<211> 389 

<212> PRT 

<213> S. pyogenes 



<400> 2 



Met 


He 


He 


Thr 


Lys 


Lys 


Ser Leu Phe 


Val 


Thr 


Ser 


Val 


Ala 


Leu 


Ser 


1 








5 






10 










15 




Leu 


Ala 


Pro 


Leu 


Ala 


Thr 


Ala Gin Ala 


Gin 


Glu 


Trp 


Thr 


Pro Arg 


Ser 








20 






25 










30 






Val 


Thr 


Glu 


He 


Lys 


Ser 


Glu Leu Val 


Leu 


Val 


Asp Asn 


Val 


Phe 


Thr 






35 








40 








45 








Tyr 


Thr 


Val 


Lys 


Tyr 


Gly Asp Thr Leu 


Ser 


Thr 


He 


Ala 


GlU 


Ala 


Met 




50 










55 






60 










Gly He Asp Val His 


Val 


Leu Gly Asp 


He 


Asn 


His 


He 


Ala 


Asn 


He 


65 










70 






75 










80 
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Asp 


Leu 


xxtr 


Phe 




Asp 


Thr 












85 










uill 


Al 

nla 


11 IX 


Thr 


Leu 


1111 


vai. 








i nn 

XVv 










Ser 


val 


Ser 




vai 


Pro 


Ser 


Ser 






115 










120 


inr 


Ser 




Ser 


Thr 


vai 


Pro 


i v iet 




ion 










-IOC 

1 J D 




vai 


Pro 


Thr 


Thr 


Pro 


Phe 


Til - 

Ala 


Ser 


1 /I c 
1*1 3 










13 U 






Ala 


Ser 


Ser 


Glu 


Leu 


inr 


Ser 












IDD 








Ser 


Ser 


Glu 


Ser 


bin 


Lys 


bin 


Pro 








180 










Thr 


Pro 


Lys 


Ala 


Ala 


Glu 


Thr 


1111 






195 












Ser 


Glu 


Asp 


Ser 


Thr 


Ser 


Ala 


Asn 




210 










215 




Ser 


Glu 


Glu Val 


Ser 


Ser 


Ala 


Ala 












230 






Glu 


Glu 


Thr 


Ser 


Ala 


Pro 


Ala 


Ala 










245 








Ser 


Val 


Ala 


Thr 


Ser Asn Gly 


Leu 








260 










Asn 


Pro 


Met 


Asn 


Ala Gly Leu 


Gin 






275 










Z o U 


Glu 


Val 


Ala 


Ser 


Ala 


Phe 


Gly 


lie 




^ y o 










295 




Gly 


Asp 


Pro Gly 


Asp 


His 


Gly 


Lys 


305 










310 






Pro 


Glu 


Asn 


Ser 


Ala Leu Gly 


Asp 










325 








His 


Met 


Ala 


Glu 


Arg 


Gly 


lie 


Ser 








340 










Tyr 


Ala 


Pro 


Phe 


Ala 


Ser 


lie 


Tyr 






355 










360 


Met 


Pro 


Asp 


Arg 


Gly Ser lie 


Thr 




37 0 










375' 




Val 


Ser 


Phe 


Asn 


Ala 









385 



Leu 


ilii 


Ala 


Asn 


Tyr 


Asn 


m n Hi a 

vjxii nio 




q n 












ml n 


7\ "1 a 
ril d. 


Pro 


Hid 


Ser 


Ser 


Pro Ala 


105 










110 




vjlU 


Pro 


Leu 


Pro 


Gin Ala 


ocx Hid 










125 






Ala 


Pro 


Ser 


7\ 1 a 

Mia 


Thr 


Pro 


Ser Asp 








l*t U 








Hid 


Lys 


Pro 


Asp 


Ser 


Ser 


val Thr 






13 3 








160 


Thr 


Asn 


Asp 


vai 


Ser 


Thr 


Glu Leu 




i / u 










175 


ulU 


17a 1 
val 


Pro 


valll 


Glu 


Ala 


Val Pro 


185 










190 




ulU 


V clX 


PI n 
bill 


Pro 


Lys 


Thr Asp lie 










205 






Arg 


Pro 


vai 


Pro 


Asn 


Glu 


Ser Ala 








9 9 0 
z z u 








Pro 


Ala 


bin 


Ala 


Pro Ala Glu Lys 






*3 "3 ^ 








240 


urn 


Lys 


Ala 


vai 


Ala Asp Thr Thr 




2oU 










255 


Ser 


Tyr 


TV ~| _ 

Ala 


Pro 


Asn 


His 


Ala Tyr 


zoj 










270 




Pro 


i?in 


Thr 


Aia 


Ala 


Phe 


Lys Glu 










285 






inr 


Ser 


pne 


ber 


Gly Tyr Arg Pro 
















Gly 


Leu 


Ala 


lie 


Asp 


Phe 


Met Val 






315 








320 


Gin 


Val 


Ala 


Gin 


Tyr 


Ala 


lie Asp 




330 










335 


Tyr 


Val 


lie 


Trp 


Lys 


Gin Arg Phe 


345 










350 




Gly 


Pro 


Ala 


Tyr 


Thr 


Trp 


Asn Pro 










365 






Glu 


Asn 


His 


Tyr 


Asp His 


Val His 








380 









<210> 3 

<211> 1182 

<212> DMA 

<213> S. pyogenes 



<400> 3 

atgattatta ctaaaaagag cttatttgtg acaagtgtcg ctttgtcgtt agcacctttg 60 

gcgacagcgc aggcacaaga gtggacacca cgatcggtta cagaaatcaa gtctgaactc 120 

gtcctagttg ataatgtttt ' tacttatata gtaaaatacg gtgacacttt aagcacaatt 180 

gctgaagcaa tggggattga tgtgcatgtc ttaggagata ttaatcatat tgctaatatt 240 

gacttaattt ttccagacac gatcctaaca gcaaactaca accaacacgg tcaggcaacg 300 

actttgacgg ttcaagcacc tgcttctagt ccatctagcg ttagtcatgt acctagcagt 3 60 

gagccattac cccaagcatc tgccacctct caaccgactg ttcctatggc accatctgcg 420 

acaccatctg atgtcccaac gacaccattc gcatctgcaa agccagatag ttctgtgaca 480 

gcgtcatctg agctcacatc gtcaacgaat gatgtttcga ctgagttgtc tagcgaatca 540 

caaaagcagc cagaagtacc acaagaagca gttccaactc ctaaagcagc tgaaccgact 600 

gaagtcgaac ctaagacaga catctcagaa gacccaactt cagctaatag gcctgtacct 660 

aacgagagtg cttcagaaga agcttcttct gcggccccag cacaagctcc agcagaaaaa 720 

gaagaaacct ctcagatgtt aactgcgcca gcagcacaaa aagctgtagc tgacaccaca 780 

agtgttgcaa cctcaaacgg cctttcttac gctccaaacc atgcctacaa tccaatgaat 840 

gcagggcttc aaccacaaac agcagccttc aaagaagaag tggcttctgc ctttggtatt 900 
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acgtcattta gtggttaccg tccaggagat ccaggagatc atggtaaagg attagccatt 960 

gactttatgg taccggttag ctctacgctt ggtgatcaag ttgctcaata tgccattgac 1020 

catatggcag agcgtggtat ttcatacgtt atttggaaac agcgattcta tgcgccattt 1080 

gcaagtattt acggaccagc ctacacatgg aaccccatgc cagatcgcgg cagtattaca 1140 

gaaaaccatt atgatcatgt tcatgtctcc tttaatgctt aa 1182 

<210> 4 

<211> 393 

<212> PRT 

<213> S. pyogenes 



<400> 4 



Met lie 


lie 


Thr 


Lys 


Lys 


Ser 


Leu 


Phe 


Val 


mr 


Ser 


Val Ala Leu 


OCX 


i 






5 










10 






15 




Leu Ala 


Pro 


Leu 


Ala 


Thr 


Ala Gin Ala , Gin 


bill 


Trp Thr Pro Arg 


Ser 






2 0 










25 








30 




Va 1 Tnr 


Glu 


lie 


Lys 


Ser 


Glu 


Leu 


Val 


Leu 


Val 


Asp 


Asn Val Phe 


Thr 




35 










40 










45 




Tyr lie 


Val 


Lys 


Tyr Gly 


Asp 


Thr 


Leu 


Ser 


±nr 


He 


Ala Glu Ala 


riet 


*50 










55 










60 






Gly lie 


Asp 


Val 


His 


Val 


Leu Gly Asp 


He 


Asn 


His 


He Ala Asn 


i le 


65 








70 










7 5 






80 


Asp Leu 


lie 


Phe 


Pro Asp 


Thr 


He 


Leu 


Thr 


Ala 


Asn Tyr Asn Gin 


His 








85 










90 






95 




Gly Gin 


Ala 


Thr 


Thr 


Leu 


Thr 


Val 


Gin 


Ala 


Pro 


Ala 


Ser Ser Pro 


Ser 






100 










105 








110 




Ser Val 


Ser 


His 


Val 


Pro 


Ser 


Ser 


Glu 


Pro 


Leu 


Pro 


Gin Ala Ser 


Ala 




115 










120 










125 




inr t»er 


Li ±TL 


Pro 


Thr 


Val 


Pro 


Met 


Ala 


Pro 


Ser 


Ala 


Thr Pro Ser 


Asp 


130 










135 










140 






Val Pro 


Thr 


Thr 


Pro 


Phe 


Ala 


Ser 


Ala 


Lys 


Pro 


Asp 


Ser Ser Val 


Thr 


145 








150 










155 






160 


Al a S e it 


Ser 


ulU 


Leu 


Thr 


Ser 


Ser 


Thr 


Asn 


Asp 


Val 


Ser Thr Glu 


Leu 








165 










170 






175 




Ser Ser 


blU 


Ser 


Gin Lys 


Gin 


Pro 


Glu 


Val 


Pro 


Gin 


Glu Ala Val 


Pro 






180 










185 








■ • 190 • 




Thr Pro 


Lys 


Ala 


Ala 


Glu 


Pro 


Thr 


Glu 


Val 


blU 


Pro 


Lys Thr Asp 


He 




195 










200 










205 




Ser Glu 


Asp 


Pro 


Thr 


Ser 


Ala 


Asn 


Arg 


Pro 


Val 


Pro 


Asn Glu Ser 


Ala 


210 










215 










220 






Ser Glu 


Glu 


Ala 


Ser 


Ser 


Ala 


Ala 


Pro 


Ala 


Gin 


Ala 


Pro Ala Glu 


Lys 


225 








230 










235 






240 


Glu Glu 


Thr 


Ser 


Gin 


Met 


Leu 


Thr 


Ala 


Pro 


Ala 


Ala Gin Lys Ala Val 








245 










250 






255 




Ala Asp 


Thr 


Thr 


Ser 


Val 


Ala 


Thr 


Ser 


Asn 


Gly 


Leu 


Ser Tyr Ala 


Pro 






260 










265 








270 




Asn His 


Ala 


Tyr 


Asn 


Pro 


Met 


Asn 


Ala Gly 


Leu 


Gin 


Pro Gin Thr 


Ala 




275 










280 










285 




Ala Phe 


Lys 


Glu 


Glu 


Val 


Ala 


Ser 


Ala 


Phe 


Gly 


He 


Thr Ser Phe 


Ser 


290 










2 95 










300 






Gly Tyr 


Arg 


Pro 


Gly 


Asp 


Pro 


Gly 


Asp 


His 


Gly 


Lys 


Gly Leu Ala 


He 


305 








310 










315 






320 


Asp Phe 


Met 


Val 


Pro 


Val 


Ser 


Ser 


Thr 


Leu 


Gly 


Asp 


Gin Val Ala 


Gin 








325 










330 






335 




Tyr Ala 


lie 


Asp 


His 


Met 


Ala 


Glu 


Arg 


Gly 


He 


Ser 


Tyr Val He 


Trp 






340 










345 








350 




Lys Gin 


Arg 


Phe 


Tyr 


Ala 


Pro 


Phe 


Ala 


Ser 


He 


Tyr Gly Pro Ala Tyr 




355 










360 










365 




Thr Trp 


Asn 


Pro 


Met 


Pro 


Asp 


Arg 


Gly Ser 


He 


Thr 


Glu Asn His 


Tyr 


370 










3 75 










380 
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Asp His Val His Val Ser Phe Asn Ala 
385 390 

<210> 5 
<211> 1170 
<212> DNA 
<213> S. pyogenes 



<400> 5 

atgattatta 

gcgacagcgc 

gtcctagttg 

gctgaagcaa 

gacctaattt 

aatttgacgg 

gagccattac 

acaccatctg 

gcgtcatctg 

caaaagcagc 

gaagtcgaac 

aacgagagtg 

gaagaaacct 

tcaaatggcc 

ccacaaacag 

ggtfcaccgtc 

cctgaaaatt 

cgtggtattt 

ggaccagcct 

gatcatgttc 



ctaaaaagag 
aggcacaaga 
ataatgtttt 
tggggattga 
ttccagacac 
ttcaagcacc 
cccaagcatc 
atgtcccaac 
agctcacatc 
cagaagtacc 
ctaagacaga 
cttcagaaga 
ctgcgccagc 
tttcttacgc 
cagccttcaa 
caggtgatcc 
ctgctcttgg 
catacgttat 
acacatggaa 
atgtctcctt 



cttatttgtg 
gtggacacca 
tacttatact 
tgtgcatgtc 
gatcctaaca 
tgcttctagt 
tgccacctct 
gacaccattc 
gtcaacgaat 
acaagaagca 
catctcagaa 
agtttcttct 
agcacaaaaa 
tccaaaccat 
agaagaagtg 
aggagatcat 
tgatcaagtt 
ttggaaacag 
ccccatgcca 
taatgcttaa 



acaagtgtcg 
cgatcggtta 
gtaaaatacg 
ttaggagata 
gcaaactaca 
ccagctagcg 
caaccgactg 
gcatctgcaa 
gatgtttcga 
gttccaactc 
gccccaactt 
gcggccccag 
gctgtagctg 
gcctacaatc 
gcttctgcct 
ggtaaaggtt 
gctcaatatg 
cgattctatg 
gatcgcggca 



ctttgtcgtt 
cagaaatcaa 
gtgacacttt 
ttaatcatat 
atcaacacgg 
ttagtcatgt 
ttcctatggc 
agccagatag 
ctgagttgtc 
ctaaagcagc 
cagctaatag 
cacaagcccc 
acaccacaag 
caatgaatgc 
ttggtattac 
tggccattga 
ccattgacca 
cgccatttgc 
gtattacaga 



agtacctttg 
gtctgaactc 
aagcacaatt 
tgctaatatt 
tcaggcaacg 
acctagcagt 
accacctgcg 
ttctgtgaca 
tagcgaatca 
tgaaacgact 
gcctgtacct 
agcagaaaaa 
tgttgcaacc 
agggcttcaa 
gtcatttagt 
ttttatggtg 
tatggcagag 
aagtatttac 
aaaccattat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1170 



<210> 6 

<211> 389 

<212> PRT 

<213> S. pyogenes 



<400> 6 



Met 


lie 


He 


Thr 


Lys 


Lys 


Ser 


Leu 


Phe 


Val 


Thr 


Ser 


Val 


Ala 


Leu 


Ser 










• 5. 










10 - 










15 • 




Leu 


Val 


Pro 


Leu 


Ala 


Thr 


Ala 


Gin 


Ala 


Gin 


Glu 


Trp 


Thr 


Pro 


Arg 


Ser 








20 










25 










30 






Val 


Thr 


Glu 


He 


Lys 


Ser 


Glu 


Leu 


val 


Leu 


Val 


Asp 


Asn 


Val 


Phe 


Thr 






35 










40 










45 








Tyr 


Thr 


Val 


Lys 


Tyr 


Gly 


Asp 


Thr 


Leu 


Ser 


Thr 


He 


Ala 


Glu 


Ala 


Met 




50 










55 










60 










Gly 


He 


Asp 


Val 


His 


Val 


Leu 


Gly 


Asp 


He 


Asn 


His 


He 


Ala 


Asn 


lie 


65 










70 










75 










80 


Asp 


Leu 


He 


Phe 


Pro 


Asp 


Thr 


He 


Leu 


Thr 


Ala 


Asn 


Tyr 


Asn 


Gin 


His 










85 










90 










95 




Gly 


Gin 


Ala 


Thr 


Asn 


Leu 


Thr 


Val 


Gin 


Ala 


Pro 


Ala 


Ser 


Ser 


Pro 


Ala 








100 










105 










110 






Ser 


Val 


Ser 


His 


Val 


Pro 


Ser 


Ser 


Glu 


Pro 


Leu 


Pro 


Gin 


Ala 


Ser 


Ala 






115 










120 










125 








Thr 


Ser 


Gin 


Pro 


Thr 


Val 


Pro 


Met 


Ala 


Pro 


Pro 


Ala 


Thr 


Pro 


Ser 


Asp 




130 










135 










140 










Val 


Pro 


Thr 


Thr 


Pro 


Phe 


Ala 


Ser 


Ala 


Lys 


Pro 


Asp 


Ser 


Ser 


Val 


Thr 


145 










150 










155 










160 


Ala 


Ser 


Ser 


Glu 


Leu 


Thr 


Ser 


Ser 


Thr 


Asn 


Asp 


Val 


Ser 


Thr 


Glu 


Leu 










165 










170 










175 




Ser 


Ser 


Glu 


Ser 


Gin 


Lys 


Gin 


Pro 


Glu 


Val 


Pro 


Gin 


Glu 


Ala 


Val 


Pro 








180 










185 










190 






Thr 


Pro 


Lys 


Ala 


Ala 


Glu 


Thr 


Thr 


Glu 


Val 


Glu 


Pro 


Lys 


Thr 


Asp 


He 



195 



200 



205 
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Ser 


Glu 


Ala 


Pro Thr 


Ser 


Ala 


Asn 


Arg 


Pro 


Val 


Pro 


Asn 


Glu 


Ser 


Ala 




210 








215 










220 










Ser 


Glu 


Glu 


Val Ser 


Ser 


Ala 


Ala 


Pro 


Ala 


Gin 


Ala 


Pro 


Ala 


Glu 


Lys 


225 








230 










235 










240 


Glu 


Glu 


Thr 


Ser Ala 


Pro 


Ala 


Ala 


Gin 


Lys 


Ala 


Val 


Ala 


Asp 


Thr 


Thr 








245 










250 










255 




Ser 


Val 


Ala 


Thr Ser 


Asn 


Gly 


Leu 


Ser 


Tyr 


Ala 


Pro 


Asn 


His 


Ala 


Tyr 








260 








265 










270 






Asn 


Pro 


Met 


Asn Ala 


Gly 


Leu 


Gin 


Pro 


Gin 


Thr 


Ala 


Ala 


Phe 


Lys 


Glu 






275 








280 










285 








Glu 


Val 


Ala 


Ser Ala 


Phe 


Gly 


lie 


Thr 


Ser 


Phe 


Ser 


Gly 


Tyr 


Arg 


Pro 




290 








295 










3 0 0 










Gly 


Asp 


Pro 


Gly Asp His Gly 


Lys 


Gly 


Leu 


Ala 


lie 


Asp 


Phe 


Met 


Val 


305 








310 










315 










320 


Pro 


Glu 


Asn 


Ser Ala 


Leu 


Gly 


Asp 


Gin 


Val 


Ala 


Gin 


Tyr 


Ala 


lie 


Asp 








325 










330 










335 




His 


Met 


Ala 


Glu Arg Gly lie 


Ser 


Tyr 


Val 


lie 


Trp 


Lys 


Gin 


Arg 


Phe 








340 








345 










350 






Tyr 


Ala 


Pro 


Phe Ala 


Ser 


lie 


Tyr 


Gly 


Pro 


Ala 


Tyr 


Thr 


Trp 


Asn 


Pro 






355 








360 










365 








Met 


Pro 


Asp 


Arg Gly Ser 


lie 


Thr 


Glu 


Asn 


His 


Tyr 


Asp 


His 


Val 


His 




370 








375 










380 










Val 


Ser 


Phe 


Asn Ala 

























385 



<210> 7 

<211> 1149 

<212> DNA 

<213> S. pyogenes 



<400> 7 

atgattatta ctaaaaagag cttatttgtg acaagtgtcg ctttgtcgtt agcacctttg 60 

gcgacagcgc aggcacaaga gtggacacca cgatcggtta cagaaatcaa gtctgaactc 120 

gtcctagttg ataatgtttt tacttataca gtaaaatacg gtgacacttt aagcacaatt 180 

gctgaagcaa tggggattga tgtgcatgtc ttaggagata ttaatcatat tgctaatatt 240 

gacttaattt ttccagacac gatcctaaca gcaaactaca atcaacacgg tcaggcaacg 300 

actttgacgg ttcaagcacc tgcttctagt ccagctagcg ttagtcatgt acctagcagt 360 

gagccattac.cccaagcatc - tgccaoctct caaecgactg .ttcctatggc aecatctgcg -420 

acaccattag catctgcaaa gccagatagt tctgtgacag cgtcatctga gctcacatcg 480 

tcaacgaatg atgtttcgac tgagtcgtct agcgaatcac aaaagcagcc agaagtacca 540 

caagaagcag ttccaactcc taaagcagct gaaacgactg aagtcgaacc taagacagac 600 

atctcagaag acccaacttc agctaatagg cctgtaccta acgagagtgc ttcagaagaa 660 

gtttcttctg cggccccagc acaagcccca gcagaaaaag aagaaacctc tgcgccagca 720 

gcacaaaaag ctgtagctga caccacaagt gttgcaacct caaacggcct ttcttacgct 780 

ccaaaccatg cctacaatcc aatgaatgca gggcttcaac cacaaacagc agccttcaaa 840 

gaagaagtgg cttctgcctt tggtattacg tcatttagtg gttaccgtcc aggtgaccca 900 

ggagatcatg gtaaaggttt ggccattgat tttatggtgc ctgaaaattc tgctcttggt 960 

gatcaagttg ctcaatatgc cattgaccat atggcagagc gtggtatttc atacgttatt 1020 

tggaaacagc gattctatgc gccatttgca agtatttacg gaccagctta cacatggaac 1080 

cccatgccag atcgcggcag tattacagaa aaccattatg atcatgttca tgtctccttt 1140 

aatgcttaa 1149 



<210> 8 

<211> 3B2 

<212> PRT 

<213> S. pyogenes 



<400> 8 

Met lie He Thr Lys Lys Ser Leu Phe Val Thr Ser Val Ala Leu Ser 

15 10 15 

Leu Ala Pro Leu Ala Thr Ala Gin Ala Gin Glu Trp Thr Pro Arg Ser 
20 . 25 30 
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Val 


Thr 


Glu 


He 


Lys 


Ser 


Glu 


Leu 


Val 


Leu 


Val 


Asp 


Asn 


Val 


Phe 


Thr 






35 










40 










45 








Tvr 


Thr 


Val 


Lvs 


Tyr Gly Asp 


Thr 


Leu 


Ser 


Thr 


He 


Ala 


Glu Ala 


Met 




50 










55 










60 










Qlv 


He 


Asp 


Val 


His 


Val 


Leu 


Glv 


Asp 


He 


Asn 


His 


He 


Ala 


Asn 


He 


65 










70 










75 










80 




Leu 


He 


Phe 


Pro Asp Thr 


He 


Leu 


Thr 


Ala 


Asn Tyr Asn Gin 


His 










85 










90 










y d 




Glv 


Gin 


Ala 


Thr 


Thr 


Leu 


Thr 


Val 


Gin 


Ala 


Pro 


Ala 


Ser 


Ser 


Pro 


Ala 








100 










105 










J. J, u 






Ser 


Val 


Ser 


His 


Val 


Pro 


Ser 


Ser 


Glu 


Pro 


Leu 


Pro 


Gin 


Ala 


Ser 


Ala 






115 










120 










IOC 








Thr 


Ser 


Gin 


Pro 


Thr 


Val 


Pro 


Met 


Ala 


Pro 


Ser 


Ala 


Thr 


Pro 


Leu 


Ala 




13 0 










135 










140 










Ser 


Ala 


Lys 


Pro Asp 


Ser 


Ser 


Val 


Thr 


Ala 


Ser 


Ser 


Glu 


Leu 


Thr 


Ser 


145 










150 










155 










160 


S&Y 


Thr 


Asn 


Asp 


Val 


Ser 


Thr 


Glu 


Ser 


Ser 


Ser 


Glu 


Ser 


Gin 


Lys 


Gin 










16.5 










170 










175 




Pro 


Glu 


Val 


Pro 


Gin 


Glu 


Ala 


Val 


Pro 


Thr 


Pro 


Lys 


Ala 


Ala 


Glu 


Thr 








180 










185 










190 






Thr 


Glu 


Val 


Glu 


Pro 


Lys 


Thr 


Asp 


He 


Ser 


Glu 


Asp 


Pro 


Thr 


Ser 


Ala 






195 










200 










205 








Asn 






Val 


Pro 


Asn 


Glu 


Ser 


Ala 


Ser 


Glu 


Glu 


Val 


Ser 


Ser 


Ala 




210 










215 










220 










Ala 


Piro 


Ala 


Gin 


Ala 


Pro 


Ala 


Glu 


Lys 


Glu 


Glu 


Thr 


Ser 


Ala 


Pro 


Ala 


225 










230 










235 










240 


Ala 


Gin 


Lys 


Ala Val Ala Asp 


Thr 


Thr 


Ser 


Val 


Ala 


Thr 


Ser 


Asn 


Gly 










245 










250 










255 




Leu 


Ser 


Tyr 


Ala 


Pro Asn His 


Ala 


Tyr 


Asn 


Pro 


Met Asn Ala Gly 


Leu 








260 










265 










270 






Gin 


Pro 


Gin 


Thr 


Ala 


Ala 


Phe 


Lvs 


Glu 


Glu 


Val 


Ala 


Ser 


Ala 


Phe 


Glv 
















280 










285 








lie 


Thr 


Ser 


Phe 


Ser Gly Tyr 


Arg 


Pro 


Glv 


Asp 


Pro Gly Asp His 


Glv 




^ zT U 










295 










300 










Lys 


Glv 


Leu 


Ala 


He 


Asp 


Phe 


Met 


Val 


Pro 


Glu 


Asn 


Ser 


Ala 


Leu 


Glv 


305 










310 










315 










320 


Asp 


Gin 


Val 


Ala 


Gin Tyr Ala 


He 


Asp 


His 


Met 


Ala Glu Arg Gly 


He 










325 










330 










335 




Ser 


Tyr 


Val 


He 


Trp 


Lys 


Gin 


Arg 


Phe 


Tyr 


Ala 


Pro 


Phe 


Ala 


Ser 


He 








340 










345 










350 






Tyr 


Gly 


Pro 


Ala Tyr Thr Trp 


Asn 


Pro 


Met 


Pro 


Asp Arg Gly 


Ser 


He 






355 










360 










365 








Thr 


Glu 


Asn 


His 


Tyr 


Asp 


His 


Val 


His 


Val 


Ser 


Phe 


Asn 


Ala 







370 375 380 



<210> 9 
<211> 1095 
<212> DNA 
<213> S. pyogenes 



<400> 9 

caagagtgga caccacgatc ggttacagaa atcaagtctg aactcgtcct agttgataat 60 

gtttttactt atactgtaaa atacggtgac actttaagca caattgctga agcaatggga 120 

attgatgtgc atgtcttagg agatattaat catattgcta atattgactt aatttttcca 180 

gacacgatcc taacagccaa ctacaaccaa cacggtcagg caacgacttt gacggttcaa 24 0 

gcgcctgctt ctagtccagc tagcgttagt catgtaccta gcagtgagce attaccccaa 300 

gcatctgcca cctctcaatc gactgttcct atggcaccat ctgcgacacc atctgatgtc 360 

ccaacgacac cattcgcatc tgcaaagcca gatagttctg tgacagcgtc atctgagctc 420 

acatcgtcaa cgaatgatgt ttcgactgag ttgtctagcg aatcacaaaa gcagccagaa 480 

gtaccacaag aagcagttcc aactcctaaa gcagctgaaa cgactgaagt cgaacctaag 54 0 

acagacatct cagaggattc aacttcagct aataggcctg tacctaacga gagtgcttca 600 

gaagaagttt cttctgcggc cccagcacaa gccccagcag aaaaagaaga aacctctgcg 660 
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ccagcagcac aaaaagctgt agctgacacc acaagtgttg caacctcaaa tggcctttct 720 

tacgctccaa accatgccta caatccaatg aatgcagggc ttcaaccaca aacagcagcc 780 

ttcaaagaag aagtggcttc tgcctttggt attacgtcat ttagtggtta ccgtccaggt 84 0 

gatccaggag atcatggtaa aggtttggcc attgatttta tggtgcctga aaattctgct 900 

cttggtgatc aagttgctca atatgccatt gaccatatgg cagagcgtgg tatttcatac 960 

gttatttgga aacagcgatt ctatgcgcca tttgcaagta tttacggacc agcctacaca 1020 

tggaacccca tgccagatcg cggcagtatt acagaaaacc attatgatca tgttcatgtc 1080 

tcctttaatg cttaa 1095 

<210> 10 

<211> 364 

<212> PRT 

<213> S. pyogenes 

<400> 10 

Gin Glu Trp Thr Pro Arg Ser Val Thr Glu lie Lys Ser Glu Leu Val 

15 10 15 

Leu Val Asp Asn Val Phe Thr Tyr Thr Val Lys Tyr Gly Asp Thr Leu 

20 25 30 

Ser Thr He Ala Glu Ala Met Gly He Asp Val His Val Leu Gly Asp 

35 40 45 

He Asn His He Ala Asn He Asp Leu He Phe Pro Asp Thr He Leu 

50 55 60 

Thr Ala Asn Tyr Asn Gin His Gly Gin Ala Thr Thr Leu Thr Val Gin 
65 70 75 80 

Ala Pro Ala ser Ser Pro Ala Ser Val Ser' His Val Pro Ser Ser Glu 

85 90 95 

Pro Leu Pro Gin Ala Ser Ala Thr Ser Gin Ser Thr Val Pro Met Ala 

100 105 110 

Pro Ser Ala Thr Pro Ser Asp Val Pro Thr Thr Pro Phe Ala Ser Ala 

115 120 125 

Lys Pro Asp Ser Ser Val Thr Ala Ser Ser Glu Leu Thr Ser Ser Thr 

130 135 140 

Asn Asp Val Ser Thr Glu Leu Ser Ser Glu Ser Gin Lys Gin Pro Glu 
145 150 155 160 

Val Pro Gin Glu Ala Val Pro Thr Pro Lys Ala Ala Glu Thr Thr Glu 

165 170 175 

Val .Glu Pro Lys. Thr Asp lie . Ser Glu. Asp. Ser Thr. Ser . .Ala Asn .Arg . 

180 185 190 

Pro Val Pro Asn Glu Ser Ala Ser Glu Glu Val Ser Ser Ala Ala Pro 

195 200 205 

Ala Gin Ala Pro Ala Glu Lys Glu Glu Thr Ser Ala Pro Ala Ala Gin 

210 215 220 

Lys Ala Val Ala Asp Thr Thr Ser Val Ala Thr Ser Asn Gly Leu Ser 
225 230 235 240 

Tyr Ala Pro Asn His Ala Tyr Asn Pro Met Asn Ala Gly Leu Gin Pro 

245 250 255 

Gin Thr Ala Ala Phe Lys Glu Glu Val Ala Ser Ala Phe Gly He Thr 

260 265 270 

Ser Phe Ser Gly Tyr Arg Pro Gly Asp Pro Gly Asp His Gly Lys Gly 

275 280 285 

Leu Ala He Asp Phe Met Val Pro Glu Asn Ser Ala Leu Gly Asp Gin 

290 295 300 

Val Ala Gin Tyr Ala He Asp His Met Ala Glu Arg Gly He Ser Tyr 
305 310 315 320 

Val He Trp Lys Gin Arg Phe Tyr Ala Pro Phe Ala Ser He Tyr Gly 

325 330 335 

Pro Ala Tyr Thr Trp Asn Pro Met Pro Asp Arg Gly Ser He Thr Glu 

340 345 350 

Asn His Tyr Asp His Val His Val Ser Phe Asn Ala 
355 360 

<210> 11 
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<211> 1106 

<212> DNA 

<213> S. pyogenes 



<400> 11 

caagagtgga 

gtttttactt 

attgatgtgc 

gacacgatcc 

gcacctgctt 

gcatctgcca 

ccaacgacac 

acatcgtcaa 

gtaccacaag 

acagacatct 

aagaagcttc 

tgttaactgc 

acggcctttc 

aaacagcagc 

accgtccagg 

ttagctctac 

gtatttcata 

cagcctacac 

atgttcatgt 



caccacgatc 
atatagtaaa 
atgtcttagg 
taacagcaaa 
ctagtccatc 
cctctcaacc 
cattcgcatc 
cgaatgatgt 
aagcagttcc 
cagaagaccc 
ttctgcggcc 
gccagcagca 
ttacgctcca 
cttcaaagaa 
agatccagga 
gcttggtgat 
cgttatttgg 
atggaacccc 
ctcctttaat 



ggttacagaa 
atacggtgac 
agatattaat 
ctacaaccaa 
tagcgttagt 
gactgttcct 
tgcaaagcca 
ttcgactgag 
aactcctaaa 
aacttcagct 
ccagcacaag 
caaaaagctg 
aaccatgcct 
gaagtggctt 
gatcatggta 
caagttgctc 
aaacagcgat 
atgccagatc 
gcttaa 



atcaagtctg 
actttaagca 
catattgcta 
cacggtcagg 
catgtaccta 
atggcaccat 
gatagttctg 
ttgtctagcg 
gcagctgaac 
aataggcctg 
ctccagcaga 
tagctgacac 
acaatccaat 
ctgcctttgg 
aaggattagc 
aatatgccat 
tctatgcgcc 
gcggcagtat 



aactcgtcct 
caattgctga 
atattgactt 
caacgacttt 
gcagtgagcc 
ctgcgacacc 
tgacagcgtc 
aatcacaaaa 
cgactgaagt 
acctaacgag 
aaaagaagaa 
cacaagtgtt 
gaatgcaggg 
tattacgtca 
cattgacttt 
tgaccatatg 
atttgcaagt 
tacagaaaac 



agttgataat 
agcaatgggg 
aatttttcca 
gacggttcaa 
attaccccaa 
atctgatgtc 
atctgagctc 
gcagccagaa 
cgaacctaag 
agtgcttcag 
acctctcaga 
gcaacctcaa 
cttcaaccac 
tttagtggtt 
atggtaccgg 
gcagagcgtg 
atttacggac 
cattatgatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1106 



<210> 12 

<211> 368 

<212> PRT 

<213> S. pyogenes 



<400> 12 



Gin 


Glu 


Trp 


Thr 


Pro 


Arg 


Ser 


Val 


Thr 


Glu 


He 


Lys 


Ser 


Glu 


Leu 


Val 


1 








5 










10 










15 




Leu 


Val 


Asp 


Asn 


Val 


Phe 


Thr 


Tyr 


He 


Val 


Lys 


Tyr 


Gly Asp 


Thr 


Leu 








20 










25 










30 






Ser 


Thr 


lie 
35 


Ala 


Glu 


Ala 


Met 


Gly 
40 


He 


Asp 


Val 


His 


Val 
45 


Leu 


Gly 


Asp 


lie Asn- His 


lie 


Ala 


Asn 


Tie 


Asp 


Leu 


lie 


Phe Pro 


Asp Thr 


lie 


Leu 




50 










55 










60 










Thr 


Ala 


Asn 


Tyr 


Asn 


Gin 


His 


Gly 


Gin 


Ala 


Thr 


Thr 


Leu 


Thr 


Val 


Gin 


65 










70 










75 










80 


Ala 


Pro Ala 


Ser 


Ser 


Pro 


Ser 


Ser 


Val 


Ser 


His 


Val 


Pro 


Ser 


Ser 


Glu 










85 










90 










95 




Pro 


Leu 


Pro 


Gin 
100 


Ala 


Ser 


Ala 


Thr 


Ser 
105 


Gin 


Pro 


Thr 


Val 


Pro 
110 


Met 


Ala- 


Pro 


Ser 


Ala 
115 


Thr 


Pro 


Ser 


Asp 


Val 
120 


Pro 


Thr 


Thr 


Pro 


Phe 
125 


Ala 


Ser 


Ala 


Lys 


Pro Asp 


Ser 


Ser 


Val 


Thr 


Ala 


Ser 


Ser 


Glu 


Leu 


Thr 


Ser 


Ser 


Thr 




130 










135 










140 










Asn 


Asp Val 


Ser 


Thr 


Glu 


Leu 


Ser 


Ser 


Glu 


Ser 


Gin 


Lys 


Gin 


Pro 


Glu 


145 










150 










155 










160 


Val 


Pro 


Gin 


Glu 


Ala 
165 


Val 


Pro 


Thr 


Pro 


Lys 
170 


Ala' 


Ala 


Glu 


Pro 


Thr 
175 


Glu 


Val 


Glu 


Pro 


Lys 
180 


Thr 


Asp 


He 


Ser 


Glu 
185 


Asp 


Pro 


Thr 


Ser 


Ala 
190 


Asn 


Arg 


Pro 


Val 


Pro 
195 


Asn 


Glu 


Ser 


Ala 


Ser 
200 


Glu 


Glu 


Ala 


Ser 


Ser 
205 


Ala 


Ala 


Pro 


Ala 


Gin 
210 


Ala 


Pro 


Ala 


Glu 


Lye 
215 


Glu 


Glu 


Thr 


Ser 


Gin 
220 


Met 


Leu 


Thr 


Ala 


Pro 


Ala 


Ala 


Gin 


Lys 


Ala 


Val 


Ala 


Asp 


Thr 


Thr 


Ser 


Val 


Ala 


Thr 


ser 


225 










230 










235 










240 


Asn 


Gly 


Leu 


Ser 


Tyr 
245 


Ala 


Pro 


Asn 


His 


Ala 
250 


Tyr 


Asn 


Pro 


Met 


Asn 
255 


Ala 
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Gly 


Leu 


Gin Pro Gin 


Thr 


Ala Ala 


Phe 


Lys 


Glu 


Glu 


Val 


Ala Ser Ala 






260 






265 










270 


Phe 


Gly 


He Thr Ser 


Phe 


Ser Gly 


Tyr 


Arg 


Pro 


Gly Asp 


Pro Gly Asp 






275 




280 










285 




His 


Gly 


Lys Gly Leu 


Ala 


He Asp 


Phe 


Met 


Val 


Pro 


Val 


Ser Ser Thr 




290 






295 








300 






Leu 


Gly 


Asp Gin Val 


Ala 


Gin Tyr 


Ala 


lie 


Asp 


His 


Met 


Ala Glu Arg 


305 






310 








315 






320 


Gly 


He 


Ser Tyr Val 


He 


Trp Lys 


Gin 


Arg 


Phe 


Tyr 


Ala 


Pro Phe Ala 






325 








330 








335 


Ser 


He 


Tyr Gly Pro Ala 


Tyr Thr 


Trp 


As ii 


Pro 


Met 


Pro 


Asp Arg Gly 






340 






345 










350 


Ser 


He 


Thr Glu Asn 


His 


Tyr Asp 


His 


Val 


His 


Val 


Ser 


Phe Asn Ala 






355 




360 










365 





<210> 13 

<211> 1095 

<212> DNA 

<213> S. pyogenes 



<400> 13 

caagagtgga caccacgatc ggttacagaa atcaagtctg aactcgtcct agttgataat 60 

gtttttactt atactgtaaa atacggtgac actttaagca caattgctga agcaatgggg 120 

attgatgtgc atgtcttagg agatattaat catattgcta atattgacct aatttttcca 180 

gacacgatcc taacagcaaa ctacaatcaa cacggtcagg caacgaattt gacggttcaa 240 

gcacctgctt ctagtccagc tagcgttagt catgtaccta gcagtgagcc attaccccaa 300 

gcatctgcca cctctcaacc gactgttcct atggcaccac ctgcgacacc atctgatgtc 3 60 

ccaacgacac cattcgcatc tgcaaagcca gatagttctg tgacagcgtc atctgagctc 420 

acatcgtcaa cgaatgatgt ttcgactgag ttgtctagcg aatcacaaaa gcagccagaa 480 

gtaccacaag aagcagttcc aactcctaaa gcagctgaaa cgactgaagt cgaacctaag 540 

acagacatct cagaagcccc aacttcagct aataggcctg tacctaacga gagt'gcttca 600 

gaagaagttt cttctgcggc cccagcacaa gccccagcag aaaaagaaga aacctctgcg 660 

ccagcagcac aaaaagctgt agctgacacc acaagtgttg caacctcaaa tggcctttct 720 

tacgctccaa accatgccta caatccaatg aatgcagggc ttcaaccaca aacagcagcc 780 

ttcaaagaag aagtggcttc tgcctttggt attacgtcat ttagtggtta ccgtccaggt 840 

gatccaggag atcatggtaa aggtttggcc attgatttta tggtgcctga aaattctgct 900 

cttggtgatc aagttgctca atatgccatt gaccatatgg cagagcgtgg tatttcatac 960 
gttatttgga aacagcgatt ctatgcgcca tttgcaagta tttacggacc • agcctacaca • 102.0 • 

tggaacccca tgccagatcg cggcagtatt acagaaaacc attatgatca tgttcatgtc 1080 

tcctttaatg cttaa 1095 



<210> 14 

<211> 364 

<212> PRT 

<213> S. pyogenes 



<400> 14 



Gin 


Glu 


Trp 


Thr 


Pro 


Arg 


Ser 


Val 


Thr 


Glu 


He 


Lys 


Ser 


Glu 


Leu 


Val 


. 1 








5 










10 










15 




Leu 


Val 


Asp 


Asn 
20 


Val 


Phe 


Thr 


Tyr 


Thr 
25 


Val 


Lys 


Tyr 


Gly 


Asp 
30 


Thr 


Leu 


Ser 


Thr 


lie 
35 


Ala 


Glu 


Ala 


Met 


Gly 
40 


lie 


Asp 


val 


His 


Val 
45 


Leu 


Gly 


Asp 


He 


Asn 
50 


His 


He 


Ala 


Asn 


He 
55 


Asp 


Leu 


lie 


Phe 


Pro 
60 


Asp 


Thr 


lie 


Leu 


Thr 


Ala 


Asn 


Tyr 


Asn 


Gin 


His 


Gly 


Gin 


Ala 


Thr 


Asn 


Leu 


Thr 


Val 


Gin 


65 










70 










75 










80 


Ala 


Pro 


Ala 


Ser 


Ser 
85 


Pro 


Ala 


Ser 


Val 


Ser 
90 


His 


Val 


Pro 


Ser 


Ser 
95 


Glu 


Pro 


Leu 


Pro 


Gin 
100 


Ala 


Ser 


Ala 


Thr 


Ser 
105 


Gin 


Pro 


Thr 


Val 


Pro 
110 


Met 


Ala 


Pro 


Pro 


Ala 
115 


Thr 


Pro 


Ser 


Asp 


Val 
120 


Pro 


Thr 


Thr 


Pro 


Phe 
125 


Ala 


Ser 


Ala 
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Lys 


Pro 


Asp 


Ser Ser Val Thr Ala 


Ser 


Ser 


Glu 


Leu 


Thr 


Ser 


Ser 


Thr 




130 




lie 
135 








i a n 
14 (J 










Asn 


Asp 


Val 


Ser Thr Glu Leu Ser 


Ser 


Glu 


Ser 


Gin 


Lys 


Gin 


Pro 


vj±U 


145 






150 






155 










160 


Val 


Pro 


Gin 


Glu Ala Val Pro Thr 


Pro 


Lys 


Ala 


Ala 


Glu 


Thr 


Thr 


Glu 








165 




170 










175 




Val 


Glu 


Pro 


Lys Thr Asp He Ser 


Glu 


Ala 


Pro 


Thr 


Ser 


Ala 


Asn 


Arg 








180 


185 










190 






Pro 


Val 


Pro 


Asn Glu Ser Ala Ser 


Glu 


Glu 


Val 


Ser 


Ser 


Ala 


Ala 


Pro 






195 


200 










205 








Ala 


Gin 


Ala 


Pro Ala Glu Lys Glu 


Glu 


Thr 


Ser 


Ala 


Pro 


Ala 


Ala 


Gin 




210 




215 








220 










Lys 


Ala 


Val 


Ala Asp Thr Thr Ser 


Val 


Ala 


Thr 


Ser 


Asn 


Gly 


Leu 


Ser 


225 






230 






235 










240 


Tyr 


Ala 


Pro 


Asn His Ala Tyr Asn 


Pro 


Met 


Asn 


Ala 


Gly 


Leu 


Gin 


Pro 








245 




250 










255 




Gin 


Thr 


Ala 


Ala Phe Lys Glu Glu 


Val 


Ala 


Ser 


Ala 


Phe 


Gly 


He 


Thr 








2 60 


265 










270 






Ser 


Phe 


Ser 


Gly Tyr Arg Pro Gly 


Asp 


Pro 


Gly 


Asp 


His 


Gly 


Lys 


Gly 






2 75 


O O A 










0 Q 
z, o O 








Leu 


Ala 


He 


Asp Phe Met Val Pro 


Glu 


Asn 


Ser 


Ala 


Leu 


Gly 


Asp 


Gin 




290 




295 








300 










Val 


Ala 


Gin 


Tyr Ala He Asp His 


Met 


Ala 


Glu 


Arg 


Gly 


He 


Ser 


Tyr 


305 






310 






315 










320 


Val 


He 


Trp 


Lys Gin Arg Phe Tyr 


Ala 


Pro 


Phe 


Ala 


Ser 


He 


Tyr 


Gly 








325 




330 










335 




Pro 


Ala 


Tyr 


Thr Trp Asn Pro Met 


Pro 


Asp 


Arg 


Gly 


Ser 


He 


Thr 


Glu 








340 


345 










350 






Asn 


His 


Tyr 


Asp His Val His Val 


Ser 


Phe 


Asn 


Ala 











355 360 



<210> 15 
<211> 1074 
<212> DNA 
<213> S. pyogenes 



<400> 15 

caagagtgga' 

gtttttactt 

attgatgtgc 

gacacgatcc 

gcacctgctt 

gcatctgcca 

geaaagecag 

tcgactgagt 

actcctaaag 

acttcagcta 

ccagcacaag 

gctgacacca 

aatccaatga 

gcctttggta 

ggtttggcca 



caccacgatc 
atacagtaaa 
atgtcttagg 
taacagcaaa 
ctagtccagc 
cctctcaacc. 
atagttctgt 
egtctagega 
cagctgaaac 
ataggectgt 
ccccagcaga 
caagtgttgc 
atgeaggget 
ttaegtcatt 
ttgattttat 



ggttacagaa 

ataeggtgae 

agatattaat 

ctaca'atcaa ■ 

tagcgttagt 

gactgttcct 

gaeagegtea 

atcacaaaag 

gactgaagtc 

acctaacgag 

aaaagaagaa 

aacctcaaac 

tcaaccacaa 

tagtggttac 

ggtgcctgaa 



atcaagtctg 
actttaagca 
catattgeta 
caeggtcagg 
catgtaccta 
atggcaccat 
tctgagctca 
cagecagaag 
gaacctaaga 
agtgcttcag 
acctctgcgc 
ggectttett 
acagcagcct 
cgtccaggtg 
aattctgetc 



aactcgtcct ■ 


agttgataat • 


60 


caattgetga 


agcaatgggg 


12 0 


atattgactt 


aatttttcca 


180 


caacgacttt 


gacggttcaa ' 


240 


geagtgagee 


attaccccaa 


300 


ctgcgacacc 


attagcatct 


360 


catcgtcaac 


gaatgatgtt 


420 


taccacaaga 


agcagttcca 


480 


cagacatctc 


agaagaccca 


540 


aagaagtttc 


ttctgcggcc 


600 


cagcagcaca 


aaaagctgta 


660 


acgctccaaa 


ccatgcctac 


720 


tcaaagaaga 


agtggcttct 


780 


acccaggaga 


tcatggtaaa 


840 


ttggtgatca 


agttgctcaa 


900 



tatgecattg accatatggc agagcgtggt atttcatacg ttatttggaa acagegatte 960 
tatgegecat ttgeaagtat ttaeggacca gcttacacat ggaaccccat gccagatcgc 1020 
ggcagtatta cagaaaacca ttatgatcat gttcatgtct ectttaatge ttaa 1074 

<210> 16 

<211> 357 

<212> PRT 

<213> S. pyogenes 

l '<400> 16 
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Gin 


Glu 


Trp 


Thr 


Pro Arg Ser Val 


Thr 


Glu 


He 


Lys 


Ser Glu 


Leu Val 


l 








5 




10 








15 


Leu 


Val 


Asp Asn 


Val Phe Thr Tyr 


Thr 


Val 


Lys 


Tyr 


Gly Asp Thr Leu 








20 




25 








30 




Ser 


Thr 


lie 


Ala 


Glu Ala Met Gly 


lie 


Asp 


val 


His 


Val Leu Gly Asp 






35 




40 










45 




lie 


Asn 


His 


lie 


Ala Asn lie Asp 


Leu 


He 


Phe 


Pro 


Asp Thr 


He Leu 




50 






£5 








60 






Thr 


Ala 


Asn Tyr 


Asn Gin His Gly 


Gin 


Ala 


Thr 


Thr 


Leu Thr 


Val Gin 


65 








70 






75 






80 


Ala 


Pro 


Ala 


Ser 


Ser Pro Ala Ser 


Val 


Ser 


His 


Val 


Pro Ser 


Ser Glu 










85 




90 








95 


Pro 


Leu 


Pro 


Gin 


Ala Ser Ala Thr 


Ser 


Gin 


Pro 


Thr 


Val Pro 


Met Ala 








100 




105 








110 




Pro 


Ser 


Ala 


Thr 


Pro Leu Ala Ser 


Ala 


Lys 


Pro 


Asp 


Ser Ser 


Val Thr 






115 




120 










125 




Ala 


Ser 


Ser 


Glu 


Leu Thr Ser Ser 


Thr 


Asn 


Asp 


Val 


Ser Thr 


Glu Ser 




130 






135 








14 0 






Ser 


Ser 


Glu 


Ser 


Gin Lys Gin Pro 


Glu 


Val 


Pro 


Gin 


Glu Ala 


Val Pro 


145 








150 






155 






160 


Thr 


Pro 


Lys 


Ala 


Ala Glu Thr Thr 


Glu 


Val 


Glu 


Pro 


Lys Thr Asp He 










165 




170 








175 


Ser 


Glu 


Asp 


Pro 


Thr Ser Ala Asn 


Arg 


Pro 


Val 


Pro 


Asn Glu 


Ser Ala 








18 0 




185 








190 




Ser 


Glu 


Glu 


Val 


Ser Ser Ala Ala 


Pro 


Ala 


Gin 


Ala 


Pro Ala 


Glu Lys 






195 




200 










205 




Glu 


Glu 


Thr 




Ala Pro Ala Ala 


Gin 


Lys 


Ala 


Val 


Ala Asp Thr Thr 




210 






215 








220 






Ser 


Val 


Ala 


Thr 


Ser Asn Gly Leu 


Ser 


Tvr 


Ala 


Pro 


Asn His 


Ala Tyr 


225 








230 






235 






240 


Asn 


Pro 


Met 


Asn 


Ala Gly Leu Gin 


Pro 


Gin 


Thr 


Ala 


Ala Phe 


Lys Glu 










245 




250 








255 


Glu 


Val 


Ala 


Ser 


Ala Phe Gly lie 


Thr 


Ser 


Phe 


Ser 


Gly Tyr Arg Pro 








260 




265 








270 




Gly 


Asd 


Pro 


Gly 


Asp His Gly Lys 


Gly 


Leu 


Ala 


He 


Asp Phe 


Met Val 






275 




280 










285 




Pro 


Glu 


Asn 


Ser 


Ala Leu Gly Asp 


Gin 


Val 


Ala 


Gin 


Tyr Ala 


He Asp 




.290 






- 29S . -. 








.300 






His 


Met 


Ala 


Glu 


Arg Gly lie Ser 


Tyr 


Val 


He 


Trp 


Lys Gin Arg Phe 


305 








310 






315 






320 


Tyr 


Ala 


Pro 


Phe 


Ala Ser lie Tyr 


Gly 


Pro 


Ala 


Tyr 


Thr Trp Asn Pro 










325 




330 








335 


Met 


Pro 


Asp 


Arg 


Gly Ser lie Thr 


Glu 


Asn 


His 


Tyr 


Asp His Val His 








340 




345 








350 




val 


Ser 


Phe 


Asn 


Ala 















355 



<210> 17 
<211> 1113 
<212> DNA 

<2 13 > S . pneumonia 
<400> 17 

atgaagaaaa gaatgttatt agcgtcaaca gtagccttgt catttgcccc agtattggca 60 

actcaagcag aagaagttct ttggactgca cgtagtgttg agcaaatcca aaacgatttg 120 

actaaaacgg acaacaaaac aagttatacc gtacagtatg gtgatacttt gagcaccatt 180 

gcagaagcct tgggtgtaga tgtcacagtg cttgcgaatc tgaacaaaat cactaatatg 240 

gacttgattt tcccagaaac tgttttgaca acgactgtca atgaagcaga agaagtaaca 3 00 

gaagttgaaa tccaaacacc tcaagcagac tctagtgaag aagtgacaac tgcgacagca 360 

gatttgacca ctaatcaagt gaccgttgat gatcaaactg ttcaggttgc agacctttct 420 

caaccaattg cagaagttac aaagacagtg attgcttctg aagaagtggc accatctacg 480 

ggcacttctg tcccagagga gcaaacgacc gaaacaactc gcccagttga agaagcaact 540 

cctcaggaaa cgactccagc tgagaagcag gaaacacaag caagccctca agctgcatca 600 
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gcagtggaag taactacaac aagttcagaa gcaaaagaag tagcatcatc aaatggagct 660 

acagcagcag tttctactta tcaaccagaa gagacgaaaa taatttcaac aacttacgag 720 

gctccagctg cgcccgatta tgctggactt gcagtagcaa aatctgaaaa tgcaggtctt 780 

caaccacaaa cagctgcctt taaagaagaa attgctaact tgtttggcat tacatccttt 340 

agtggttatc gtccaggaga cagtggagat cacggaaaag gtttggctat cgactttatg 900 

gtaccagaac gttcagaatt aggggataag attgcggaat atgctattca aaatatggcc 960 

agccgtggca ttagttacat catctggaaa caacgtttct atgctccatt cgatagcaaa 1020 

tatgggccag ctaacacttg gaacccaatg ccagaccgtg gtagtgtgac agaaaatcac 1080 

tatgatcacg ttcacgtttc aatgaatgga taa 1113 

<210> 18 
<211> 370 
<212> PRT 

<213> S. pneumonia 
<400> 18 



Met ;Lys 


Lys 


Arg Met 


Leu 


Leu 


Ala 


Ser 


Thr 


Vai 


Ala 


Leu 


Ser 


Phe 


Ala 










5 










10 










15 




PrO 


vai 


Leu 


Ala 


Thr 


Gin 


Ala 


Glu 


Glu 


Vai 


Leu 


lip 


Thr 


Ala 


ZVtct 


Ser 








20 










D R 
d 










30 






vai 


blU 


m n 


He 


Gin 


Asn 


ASp 


J_iGU 


Thr 


y s 


Thr 


Asp 


Asn 


Lys 


Thr 


Ser 






3b 




















A Z. 
ft 3 








Tyr 


Thr 


vai 


Gin 


Tyr 


yrJ.y 


Asp 


1LLL 


Leu 


Scr 


XLLL 


± ± c 


a") .a 

/i-L ct 


Ul u 




Leu 




bU 










b b 










o U 










Gly 


vai 


Asp 


Vai 


Thr 


Vdl 


Leu 


7\ "1 a 

/i J.a 


Asn 


Leu 


Asn 


Lys 


Tl A 

J. J.C 


Tiir 


Asn 




O 3 










/ u 




















o u 


Asp 


Leu 


lie 


Phe 


Pro 


Glu 


Thr 


Vai 


Leu 


Thr 


Thr 


Thr 


Vai 


Asn 


Glu 


Ala 










85 










on 










y d 




Glu 


U1U 


Vai 


Thr 


Glu 


Vai 


Glu 


He 


Gin 


Thr 


Pro 


Gin 


Ala 


Asp 


Ser 


Ser 








100 










1 AC 
IvD 










J. X U 








o±U 


V d ± 


Thr 


Thr 


«. J. a. 


X Hi. 


Hid 


J\ on 


Leu 


Thr 


Thr 


Asn 


Gin 


Vai 


Thr 






T 1 £ 

± J.D 










ion 










± £ o 








vai 


Asp 


Asp 


Gin 


Thr 


Vfl 1 
Vdl 




VcLJ. 


7\ 1 a 
•ci-L ct 


A 

sp 


eu 




Gin 


Pro 


He 


Ala 














135 




















Glu 


Vai 


Trir 


Lys 


Thr 


Vdl 


lie 


Ala 


del 


nl n 

\j J. u 


ul U. 


Vai 


Ala 


Pro 


Ser 


Thr 


14 5 








15 0 










1 c c 










loU 


Gly 


Thr 


Ser 


Vai 


Pro 




olU 


lj J- 11 


1 XI IT 


X ILX\ 




Thr 




r 9 


Pro 


Vai 










165 










170 










17.5 




Glu 


Glu 


Ala 


Thr 


Pro 


Gin 


Glu 


Thr 


Thr 


Pro 


Ala 


Glu 


Lys 


Gin 


Glu 


Thr 








180 










185 










190 






Gin 


Ala 


Ser 


Pro- Gin 


Ala 


Ala 


Ser 


Ala- 


Vai 


Glu 


Vai 


Thr 


Thr 


■Thr 


Ser 






195 










200 










205 








Ser 


Glu 


Ala 


Lys 


Glu 


Vai 


Ala 


Ser 


Ser 


Asn 


Gly 


Ala 


Thr 


Ala 


Ala 


Vai 




210 










215 










220 










Ser 


Thr 


Tyr 


Gin 


Pro 


Glu 


Glu 


Thr 


Lys 


He 


lie 


Ser 


Thr 


Thr 


Tyr 


Glu 


225 










230 










235 










240 


Ala 


Pro 


Ala 


Ala 


Pro 


Asp 


Tyr 


Ala 


Gly 


Leu 


Ala 


Vai 


Ala 


Lys 


Ser 


Glu- 










245 










250 










255 




Asn 


Ala 


Gly 


Leu 


Gin 


Pro 


Gin 


Thr 


Ala 


Ala 


Phe 


Lys 


Glu 


Glu 


He 


Ala 








260 










265 










270 






Asn 


Leu 


Phe 


Gly He 


Thr 


Ser 


Phe 


Ser 


Gly 


Tyr 


Arg 


Pro 


Gly 


Asp 


Ser 






275 










280 










285 








Gly 


Asp 


His 


Gly Lys 


Gly 


Leu 


Ala 


lie 


Asp 


Phe 


Met 


Vai 


Pro 


Glu 


Arg 




290 










2 95 










300 










Ser 


Glu 


Leu 


Gly Asp 


Lys 


He 


Ala 


Glu 


Tyr 


Ala 


He 


Gin 


Asn 


Met 


Ala 


305 










310 










315 










320 


Ser 


Arg 


Gly 


He 


Ser 


Tyr 


He 


He' 


Trp 


Lys 


Gin 


Arg 


Phe 


Tyr 


Ala 


Pro 










325 










330 










335 




Phe 


Asp 


Ser 


Lys 


Tyr 


Gly 


Pro 


Ala 


Asn 


Thr 


Trp 


Asn 


Pro 


Met 


Pro 


Asp 








340 










345 










350 






Arg 


Gly 


Ser 


Vai 


Thr 


Glu 


Asn 


His 


Tyr 


Asp 


His 


Vai 


His 


Vai 


Ser 


Met 






355 










360 










365 
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Asn Gly 
' 370 

<210> 19 
<211> 1183 
<212> DNA 
<213> S. pyogenes 



<220> 

<221> misc_dif f erence 
<222> (428) . . . (448) 

<223> nnnnnrmnjumnnnimnnnnn can be ctgatgtccaacgacaccat 
or absent 



<221> misc_diff erence 
<222> (733) . . . (744) 

<223> nnnnnnnnnnnn can be cagatgttaact or absent 



<221> misc_diff erence • 
<222> (883) . . . (883) 
<223> n is g or absent 

<221> misc_diff erence 
<222> (943) . . . (943) 

<223> h is t or absent 

<400> 19 

atgattatta ctaaaaagag yttatttgtg acaagtgtcg ctttgtcgtt agyacctttg 60 

gcgacagcrc aggcacaaga gtggacacca cgatcggtta casaaatcaa gtctgaactc 120 

gtcctagttg ataatgtttt tacttatayw gtaaaatacg gtgacacttt aagcacaatt 180 

gctgaagcaa tgggrattga tgtgcatgtc ttaggagata ttaatcatat tgctaatatt 24 0 

gacytaattt ttccagacac gatcctaaca gcmaactaca aycaacacgg tcaggcaacg 300 

amtttgacgg ttcaagcrcc tgcttctagt ccakctagcg ttagtcatgt acctagcagt 360 

gagccattac cccaagcatc tgccacctct caaycgactr ttcctatggc accayctgcg 420 

acaccatnnn nnnnnnnnnn nnnnnnnntm gcatctgcaa agccagatag ttytgtgaca 480 

gcgtcatctg agctcacatc rtcaacgaat gatgtttcga ctgagtygtc tagcgaatca 540 

caaaagcagc cagaagtacc acaagaagca gwwccaactc ctaaagcagc tgaamssact 600 

gaagtcgaae ctaagacaga catcteagar. gmyycaactt cagctaatag gcctgtacct- . 660 

aacgrragtg cttcagaaga agyttcttct gcggccccag cacaagcycc agcagaaaaa 720 

gaagaaacct ctnnnnnnnn nnnngcgcca gcagcacaaa aagctgtagc tgacaccaca 780 

agtgttgcaa cctcaaaygg cctttcttac gctccaaacc atgcctacaa tccaatgaat ' 84 0 

gcagggcttc aaccacaaac agcagccttc aaagaagaag tgncttctgc ctttggtatt 900 

acgtcattta gtggttaccg tccaggwgay ccaggagatc atnggtaaag gwttrgccat 960 

tgaytttatg gtrcckgwwa rytctrckct tggtgatcaa gttgctcaat atgccattga 1020 

ccatatggca gassgtggta tttcatacgt tatttggaaa cagcgattct atgcgccatt 1080 

tgcaagtatt tacggaccag cytacacatg gaaccccatg ccagatcgcg gcagtattac 114 0 

agwwwwccat tatgatcatg ttcatgtctc ctttaatgct taa 1183 



<210> 20 

<211> 393 

<212> PRT 

<213> s. pyogenes 

<220> 

<221> VARIANT 
<222> (18) . . . (18) 
<223> Xaa = Ala or Val 



<221> VARIANT 
<222> (50) . . . (50) 
<223> Xaa « Thr or lie 



<221> VARIANT 
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<222> (101) . - - (101), 
<22 3> Xaa = Thr or Asn 

<2 21> VARIANT 
<222> (112) . . . (112) 
<223> Xaa = Ala or Ser 

<221> VARIANT 
<222> (132) . . . (132) 
<22 3> Xaa = Pro or Ser 

<2 21> VARIANT 
<222> (134) . . . (134) 
<223> xaa = val or lie 

<221> VARIANT 
<222> (139) . . . (139) 
<22 3> Xaa = Ser or Pro 

<2 21> VARIANT 
<222> (143) . . . (149) 

<2 23> Xaa Xaa Xaa Xaa Xaa Xaa Xaa = Ser Asp Val Pro Thr 
Thr pro or absent 

<221> VARIANT 
<222> (150) . . . (150) 
<223> Xaa * Phe or Leu 

<221> VARIANT 

<222> (158) . . . (158) 

<2 23> Xaa = Ser or Phe 

<221> VARIANT 
<222> (176) . . . (176) 
<223> Xaa = Leu or Ser 

<221> VARIANT 
<222> • (191) . ; . <191> 
<223> Xaa = Val or Glu 

<221> VARIANT ' 

<222> (199) . . . (199) 

<2 23> Xaa = Thr or Pro or Ser 

<221> VARIANT 
<222> (211) . . . (211) 
<223> Xaa = Asp or Ala 

<2 21> VARIANT 
<222> (212) . . . (212) 
<223> Xaa = Pro or Ser 

<221> VARIANT 
<222> (222) . . . (222) 
<223> Xaa = Glu or Gly 

<2 21> VARIANT 
<222> (228) . . . (228) 
<223> Xaa - Val or Ala 

<221> VARIANT 
<222> (242) . . . (245) 

<2 23> Xaa Xaa Xaa Xaa = Glu thr Ser Gin or absent 
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<221> VARIANT 
<222> (246) . . . (246) 
<223> Xaa = Glu or Met 

<221> VARIANT 
<222> (247) . . . (247) 
<223> Xaa = Thr or Leu 

<221> VARIANT 
<222> (248) . . . (248) 
<223> Xaa = Ser or Thr 

<221> VARIANT 
<222> (295) . . . (295) 
<223> Xaa « Ala or Leu 

<221> VARIANT 
<222> (296) . . . (296) 
<223> Xaa = Ser or Leu 

<221> VARIANT 
<222> (297) . . . (297) 
<223> Xaa = Ala or Pro 

<221> VARIANT 
<222> (298) . . . (298) 
<223> Xaa = Phe or Leu 

<221> VARIANT 
<222> (299) . . . (299) 
<223> Xaa = Gly or Val 

<221> VARIANT 
<222> (300) . . . (300) 
<223> Xaa = He or Leu 

.. <221> VARIANT. ...... 

<222> (301) . . . (301) 
<223> Xaa = Thr or Arg 

<221> VARIANT 
<222> (302) . . . (302) 
<223> Xaa = Ser or His 

<221> VARIANT 
<222> (303) . . . (303) 
<223> Xaa = Phe or Leu 

<221> VARIANT 
<222> (304) . . . (304) 
<223> Xaa = Ser or Val 

<221> VARIANT 
<222> (305) . . . (305) 
<223> Xaa = Gly or Val 

<221> VARIANT 
<222> (306) . . . (306) 
<223> Xaa = Tyr or Thr 



<221> VARIANT 
<222> (307) . . . (307) 



< 
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<223> Xaa = Arg or Val 

<221> VARIANT 
<222> (308) . . . (308) 
<223> Xaa sa Pro or Gin 

<221> VARIANT 

<222> (309) . . . (309) 

<2 23> Xaa = Gly or Glu 

<221> VARIANT 
<222> (310) . . . (310) 
<223> Xaa = Asp or He 

<221> VARIANT 
<222> (311) . . . (311) 
<223> Xaa = Pro or Gin 



<221> VARIANT 
<222> (312) . . . (312) 
<223> Xaa = Gly or Glu 

<221> VARIANT 
<222> (313) . . . (313) 
<223> Xaa = Asn or He 



<221> VARIANT 
<222> (314) . . . (314) 
<223> Xaa = His or He 



<221> VARIANT 
<222> (326) . . . (326) 
<223> Xaa = Glu or Val 



<221> VARIANT 
<222> (327) . . . (327) 
<223> Xaa = Asn or Ser 

<221> VARIANT 
<222> (329) . . . (329) 
<223> Xaa = Ala or Th-r 



<221> VARIANT 

<222> (344) . . . (344) 

<2 23> Xaa = Glu or Asp 

<221> VARIANT 

<222> (345) . . . (345) 

<2 23> Xaa = Arg or Gly 



<400> 20 

Met He He Thr Lys Lys Ser Leu 

1 5 
Leu Xaa Pro Leu Ala Thr Ala Gin 
20 

Val Thr Glx He Lys Ser Glu Leu 

35 40 
Tyr Xaa Val Lys Tyr Gly Asp Thr 

50 55 
Gly He Asp Val His Val Leu Gly 
65 70 



Phe Val Thr Ser Val Ala Leu Ser 

10 15 
Ala Gin Glu Trp Thr Pro Arg Ser 
25 30 
Val Leu Val Asp Asn Val Phe Thr 
45 

Leu Ser Thr He Ala Glu Ala Met 
60 

Asp He Asn His He Ala Asn He 
75 80 
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Asp 


Leu 


xxe 


pne 


Pro Asp 


inr 


lie 


Leu 


inr 


Ala Asn Tyr Asn Gin His 
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Q A 






y b 






Aid 


liir 


Xaa 


Leu 


inr 


vai 


LrXn 


Aia 


Pro Ala 


Ser Ser 


Pro Xaa 
























inn 

11U 




Ser 


v ax 


Ser 


ril S 


val 


Pro 


q pr . 

□ CI 


Ser 


ul U 


Pro 


Leu Pro 


Gin Ala 


Ser Ala 






1 1 c 

llj 










ion 








125 




J- liX 


Ser 




Ya a 
Add 


inr 


Xaa 


Pro 


Met- 
ric U- 


HJ.d 




Xaa Ala 


Thr Pro 


Xaa Xaa 




IjU 










1 

J.J3 








14,0 






Add 


Add 


Add 


Add 


Ya a 
Add 


Yaa 
Add 


Ml d 


Cor 

OCX. 


M.J. d 


Lys 


Pro Asp 


ser xaa 


Val Thr 












ICO 

IbU 










155 




160 


Aid 


Oca -i- 


Ser 


bill 


Leu 


Thr 


Ser 


Ser 


inr 


Asn 


Asp Val 


Ser Thr 


Glu Xaa 










165 










1 / u 






175 


Ser 




m ii 

LrXU 


Ser 


oxn 


Lys 


bin 


Pro 


nl n 
ulu 


V ax 


Pro Gin 


Glu Ala 


Xaa Pro 








T fl o 

iCU 










1 A c 

J.OO 






190 




Thr 


Pro 


'Lys 


Pi. ±3. 


Ala 


Glu 


Ya a 
Add 


Thr 


u. 


Va 1 

v a j. 


Glu Pro Lys Thr Asp lie 






xy b 










O ft fi 

Z u u 








205 




Sex 


lj J. 11 


Aaa 


Add 


Thr 


Ser 


an a 


Asn 


Arg 


Pro 


Val Pro 


Asn Xaa 


Ser Ala 




01 ft 

A X U 










Z 1 O 








220 






Ser 




(jXU 


Xaa 


Ser 


Ser 


a 1 =» 
Ala 


Ala 


Pro 


AXa 


Gin Ala 


Pro Ala 


Glu Lys 


22b 










230 










235 




240 


Glu 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Ala 


Pro 


Ala Ala Gin Lys Ala Val 










245 










250 






255 


Ala 


Asp 


Thr 


Thr 


Ser 


Val 


Ala 


Thr 


Ser 


Asn 


Gly Leu Ser Tyr Ala Pro 








260 










2 65 






270 




Asn 


His 


Ala 


Tyr 


Asn 


Pro 


Met 


Asn 


Ala 


Gly 


Leu Gin 


Pro Gin 


Thr Ala 






A / b 


















285 




Ala 


Pne 


Lys 


Glu 


Glu 


Val 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa Xaa 


Xaa Xaa 


Xaa Xaa 




O Q A 

z y u 










295 








300 






Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Gly Lys 


Gly Leu Ala lie 


n n c 
o Ob 










310 










315 




320 


Asp 


Phe 


Met 


Val 


Pro 


Xaa 


Xaa 


Ser 


Xaa 


Leu 


Gly Asp Gin Val Ala Gin" 










325 










330 






335 


Tyr 


Ala 


lie 


Asp 


His 


Met 


Ala 


Xaa 


Xaa 


Gly 


lie Ser Tyr Val 


lie Trp 








340 










345 






350 




Lys 


Gin 


Arg 


Phe 


Tyr 


Ala 


Pro 


Phe 


Ala 


Ser 


lie Tyr Gly Pro Ala Tyr 






355 










360 








365 




Thr 


Trp 


Asn 


Pro 


Met 


Pro Asp 


Arg 


Gly 


Ser 


lie Thr 


Xaa Xaa 


His Tyr 




370 










-375 








' ■ ■ 380 






Asp 


His 


Val 


His 


Val 


Ser 


Phe 


Asn 


Ala 










385 










390 

















<210> 21 
<211> 32 
<212> DMA 

<213> Artificial Sequence 
<220> 

«^223> DMAR16 Oligonucleotide 
<400> 21 

caggccatgg agtggacacc acgatcggtfc ac 32 

<210> 22 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DMAR17 Oligonucleotide 
<400> 22 

gccgctcgag agcattaaag gagacatgaa catgatc 37 
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<210> 23 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Signal peptide predicted from analysis of SEQ ID 
NO: 2 

<400> 23 

Met lie lie Thr Lys Lye Ser Leu Phe Val Thr Ser Val Ala Leu Ser 

15 10 15 

Leu Ala Pro Leu Ala Thr Ala Gin Ala 
20 25 

<210> 24 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> VARIANT 

<222> (3) . . . (3) 

<223> Xaa = Any Amino Acid 

<223> Cell wall anchoring motif 

<400> 24 

Leu Pro Xaa Thr Gly 
1 5 

<210> 25 
<211> 6 
<212> PRT • 

<213> Artificial Sequence 
<220> 

. <223>. IgA binding, motif 
<400> 25 

' Met Leu • Lys Lys lie Glu 
1 5 

<210> 26 
<:211> 28 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> DMAR69 oligonucleotide 
<400> 26 

ctgggaagat tatctagcac attaatac 28 

<210> 27 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> DMAR72 oligonucleotide 
<400> 27 • 
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cataacgtta aaactgtcta aaggg 25 



<210> 28 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DMAR24 oligonucleotide 
<400> 28 

tacccggatc cccaagagtg gacaccacga tcgg 

<210> 29 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



34 



<220> 

<223> DiVIAR25 oligonucleotide 
<400> 29 

gcgctcgtcg acgcgtatct cagcctctta tagggc. 36 
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What is claimed is: 

1. An isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide having a sequence 
chosen from: SEQ ID NOs : 2, 4, 6, 8, 10, 12, 14, 16, 2 0 or 
fragments, analogues or derivatives thereof. 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 95% 
identity to the second polypeptide. 

3. An isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide having a sequence 
chosen from: SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16 or 20. 

4. A polynucleotide according to claim 3, wherein said 
polynucleotide encodes a polypeptide having at least 95% 
t identity to the second polypeptide. 

5. An isolated polynucleotide encoding a polypeptide capable of 
generating antibodies having binding specificity for a 

... polypeptide, having, a . sequence chosen, from:. ..SEQ ID. NOs : 2, .4,. 
6, 8, 10, 12, 14, , 16, 2 0 or fragments, analogues or 
derivatives thereof. 

6. An isolated polynucleotide encoding a polypeptide capable of 
generating antibodies having binding specificity for a 
polypeptide having a sequence chosen from: SEQ ID NOs: 2, 4, 
6, 8, 10, 12, 14, 16 or 20. 

7. An isolated polynucleotide that is complementary to the 
polynucleotide of any of claims 1 to 6 . 

8. The polynucleotide of' any of claims 1 to 6 , wherein said 
polynucleotide is DNA. 
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The polynucleotide of any of claims 1 to 6, wherein said 
polynucleotide is RNA. 

10. A polynucleotide which hybridizes under stringent conditions 
to a second polynucleotide having a sequence chosen from: 
SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 19. 

11. A polynucleotide according to claim 10 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

12. A polynucleotide which hybridizes under stringent conditions 
to a second polynucleotide having a sequence chosen from: 
SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 19. 

13. A polynucleotide according to claim 12 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

14. An isolated polynucleotide having a sequence comprising SEQ 
ID NO: 19. 

15. A vector comprising the polynucleotide of any of claims 1 
to 6 or 10 to 14, wherein said DNA is operably linked to an 
expression control region. 

16. A vector comprising the polynucleotide of claim 7, wherein 
said DNA is operably linked to an expression control 
region. 

17. A host cell transfected with the vector of claim 15. 

18. A host cell transfected with the vector of claim 16. 
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19. A process for producing a polypeptide comprising culturing a 
host cell according to claim 17 under conditions suitable for 
expression of said polypeptide. 

20. A process for producing a polypeptide comprising culturing a 
host cell according to claim 18 under condition suitable for 
expression of said polypeptide. 

21. An isolated polypeptide having at least 70% identity to a 
second polypeptide having an amino acid sequence chosen from: 
SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 20 or fragments, 
analogues or derivatives thereof. 

22. An isolated polypeptide having at least 95% identity to a 
second polypeptide having an amino acid sequence -chosen from: 
SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 20 or fragments, 
analogues or derivatives thereof. 

23. An isolated polypeptide having at least 70% identity to a 
second polypeptide having an amino acid sequence chosen from: 
SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16 or 20. 

24. An isolated polypeptide having at least 95% identity to a 
second polypeptide having an amino acid sequence chosen from:' 
SEQ ID NOs: 2 , 4 , 6, 8 , 10 , 12 , 14 , 16 or 20 . 

25. An isolated polypeptide capable of generating antibodies 
having binding specificity for a second polypeptide having a 
sequence chosen from: SEQ ID NOs : 2, 4, 6, 8, 10, 12, 14, 16, 
20 or fragments, analogues or derivatives thereof. 

26. An isolated polypeptide capable of generating antibodies 
having binding specificity for a second polypeptide having a 
sequence chosen from: SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16 
or 20 . 
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27. An isolated polypeptide having an amino acid sequence chosen 
from SEQ ID NOs : 2, 4, 6, 8, 10, 12, 14, 16 , 20 or fragments, 
analogues or derivatives thereof . 

28. An isolated polypeptide having an amino acid sequence chosen 
from SEQ ID NOs : 2, 4, 6, 8, 10, 12, 14, 16 or 20. 

29. An isolated polypeptide according to any of claims 21 to 28, 
wherein the N- terminal Met residue is deleted. 

30. An isolated polypeptide according to any of claims 21 to 28, 
wherein the secretory amino acid sequence is deleted. 

31. An isolated polypeptide according to claim 29, wherein the 
secretory amino acid sequence is deleted. 

32. A vaccine composition comprising a polypeptide according to 
any one of claims 21 to 28 and a pharmaceutically acceptable 
carrier, diluent or adjuvant. 

33. A vaccine composition comprising a polypeptide according to 
. claim .29 . and. a. pharmaceutically. acceptable. . .carrier.,., diluent 

or adjuvant. 

34. A vaccine composition comprising a polypeptide according to 
claim 3 0 and a pharmaceutically acceptable carrier, diluent 
or adjuvant . 

35. A vaccine composition comprising a polypeptide according to 
claim 31 and a pharmaceutically acceptable carrier, diluent 
or ad j uvant . 

36. A method for therapeutic' or prophylactic treatment of 
pharyngitis, erysipelas and impetigo, scarlet fever, and 
invasive diseases such as bacteremia and necrotizing 
fasciitis in an individual susceptible to pharyngitis, 
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erysipelas and impetigo, scarlet fever, and invasive diseases 
such as bacteremia and necrotizing fasciitis and also toxic 
shock comprising administering to said individual a 
therapeutic or prophylactic amount of a composition according 
to claim 32 . 

37. A method for therapeutic or prophylactic treatment of 
pharyngitis, erysipelas and impetigo, scarlet fever, and 
invasive diseases such as bacteremia and necrotizing 
fasciitis in an individual susceptible to pharyngitis, 
erysipelas and impetigo, scarlet fever, and invasive diseases 
such as bacteremia and necrotizing fasciitis and also toxic 
shock comprising administering to said individual a 
therapeutic or prophylactic amount of a composition according 
to claim 33 . 

38. A method for therapeutic or prophylactic treatment of 
pharyngitis, erysipelas and impetigo, scarlet fever, and 
invasive diseases such as bacteremia and necrotizing 
fasciitis in an individual susceptible to pharyngitis, 
erysipelas and impetigo, scarlet fever, and invasive diseases 

. such . as. bacteremia . and necrotizing, fasciitis, and also toxic, 
shock comprising administering to said individual a 
therapeutic' or prophylactic amount of a composition according" 
to claim 34 . 

39. A method for therapeutic or prophylactic treatment of 
pharyngitis, erysipelas and impetigo, scarlet fever, and 
invasive diseases such as bacteremia and necrotizing 
fasciitis in an individual susceptible to pharyngitis, 
erysipelas and impetigo, scarlet fever, and invasive diseases 
such as bacteremia and necrotizing fasciitis and also toxic 
shock comprising administering to said individual a 
therapeutic or prophylactic amount of a composition according 
to claim 35. 
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40. A method for therapeutic or prophylactic treatment- of 
Streptococcus pyogenes bacterial infection in an individual 
susceptible to Streptococcus pyogenes infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 32. 

41. A method for therapeutic or prophylactic, treatment of 
Streptococcus pyogenes bacterial infection in an individual 
susceptible to Streptococcus pyogenes infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 33. 

42. A method for therapeutic or prophylactic treatment of 
Streptococcus pyogenes bacterial infection in an individual 
susceptible to Streptococcus pyogenes infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 34. 

43. A method for therapeutic or prophylactic treatment of 
Streptococcus pyogenes bacterial infection in an individual 
susceptible to Streptococcus pyogenes infection comprising 

... administering . to. said . . indiyiciual . . a . therapeutic . or 
prophylactic amount of a composition according to claim 35. 

44. Use of a vaccine composition according to claim 32 for the 
prophylactic or therapeutic treatment of Streptococcal 
infection in an individual susceptible to or infected with 
streptococcal infection comprising administering to said 
individual a prophylactic or therapeutic amount of the 
composition. 

45. Use of a vaccine composition according to claim 33 for the 
prophylactic or therapeutic treatment of Streptococcal 
infection in an individual susceptible to or infected with 
streptococcal infection comprising administering to said 
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individual a prophylactic or therapeutic amount of the 
composition. 

46. Use of a vaccine composition according to claim 34 for the 
prophylactic or therapeutic treatment of Streptococcal 
infection in an individual susceptible to or infected with 
streptococcal infection comprising administering to said 
individual a prophylactic or therapeutic amount of the 
composition. 

47. Use of a vaccine composition according to claim 35 for the 
prophylactic or therapeutic treatment of Streptococcal 
infection in an individual susceptible to or infected with 
streptococcal . infection comprising administering to said 
individual a prophylactic or therapeutic amount of the 
composition. 
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Figure 1 



1 


ATGATTATTA 


C T AAAAAG AG 


CTTATTTGTG 


ACAAGTGTCG 


CTTTGTCGTT 


AGCACCTTTG 


61 


GCGACAGCAC 


AGGCACAAGA 


GTGGACACCA 


CGATCGGTTA 


CAGAAATCAA 


GTCTGAACTC 


121 


GTCCTAGTTG 


ATAATGTTTT 


TACTTATACT 


GTAAAATACG 


GTGACACTTT 


AAGCACAATT 


181 


GCTGAAGCAA 


TGGGAATTGA 


TGTGCATGTC 


TTAG GAG ATA 


TTAATCATAT 


TGCTAATATT 


241 


GACTTAATTT 


TTCCAGACAC 


GATCCTAACA 


GCCAACTACA 


ACCAACACGG 


TCAGGCAACG 


301 


ACTTTGACGG 


TTCAAGCGCC 


TGCTTCTAGT 


CCAGCTAGCG 


TTAGTCATGT 


ACCTAGCAGT 


361 


GAGCCATTAC 


CCCAAGCATC 


TGCCACCTCT 


CAATCGACTG 


TTCCTATGGC 


ACCATCTGCG 


421 


AC AC CAT C TG 


ATGTCCCAAC 


GACACCATTC 


GCATCTGCAA 


AGCCAGATAG 


TTCTGTGACA 


481 


GCGTCATCTG 


AGCTCACATC 


GTCAACGAAT 


GATGTTTCGA 


CTGAGTTGTC 


TAGCGAATCA 


541 


CAAAAGCAGC 


CAGAAGTACC 


ACAAGAAGCA 


GTTCCAACTC 


CTAAAGCAGC 


TGAAACGACT 


6C1 


GAAGTCGAAC 


CTAAGACAGA 


CATCTCAGAG 


GATTCAACTT 


CAGCTAATAG 


GCCTGTACCT 


661 


AACGAGAGTG 


CTTCAGAAGA 


AGTTTCTTCT 


GCGGCCCCAG 


CACAAGCCCC 


AGCAGAAAAA 


721 


GAAGAAACCT 


CTGCGCCAGC 


AG C AC AAAAA 


GCTGTAGCTG 


ACACCACAAG 


TGTTGCAACC 


781 


TCAAATGGCC 


TTTCTTACGC 


TCCAAACCAT 


GCCTACAATC 


CAATGAATGC 


AGGGCTTCAA 


841 


CCACAAACAG 


CAGCCTTCAA 


AGAAGAAGTG 


GCTTCTGCCT 


T TGGTATTAC 


GTCATTTAGT 


901 


GGTTACCGTC 


CAGGTGATCC 


AGGAGATCAT 


GGTAAAGGTT 


TGGCCATTGA TTTTATGGTG 


961 


CCTGAAAATT 


CTGCTCTTGG 


TGATCAAGTT 


GCTCAATATG 


CCATTGACCA 


TATGGCAGAG 



1021 CGTGGTATTT CAT AC GTTAT TTGGAAACAG CGATTCTATG CGCCATTTGC AAGTATTTAC 
1081 GGACCAGCCT ACACATGGAA CCCCATGCCA GATCGCGGCA GT AT T AC AG A AAACCATTAT 
1141 GATCATGTTC ATGTCTCCTT TAATGCTTAA (SEQ ID N0:1) 



Figure 2 

1 MIITKKSLFV TSVALSLAPL ATAQAQEWTP RSVTEIKSEL VLVDNVFTYT VKYGDTLSTI 
61 AEAMGIDVHV LGDINHIANI DLIFPDTILT AWYNQHGQAT TLTVQAPASS PASVSHVPSS 
121 EPLPQASATS QSTVPMAPSA TPSDVPTTPF ASAKPDSSVT ASSSLTSSTN DVSTELSSES 
181 QKQPEVPQEA VPTPKAAETT EVEPKTDISE DSTSANRPVP NESASEEVSS AAPAQAPAEK 
241 EETSAPAAQK AVADTTSVAT SMGLSYAPNH AYNPMNAGJjQ PQTAAFKEEV ASAFGITSFS 
301 GYRPGDPGDH GKGLiAIDFMV PENSALGDQV AQYAIDHMAE RGISYVIWKQ RFYAPFASIY 
361 GPAYTWNPMP DRGSITENHY DHVHVSFNA* (SEQ ID NO:2) 
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Figure 3 



X 


A1GA1 1A1 1A 


C 1 AAAAAGAG 


C 1 1 Al 1 Iblb 




CTTTGTCGTT 


AGCACCTTTG 


61 


GCGACAGLGC 


AGGCACAAGA 


G I GGACACCA 


CGATCGGTUA 


CAGAAATCAA 


GTCTGAACTC 


TOT 

l<Sl 


G±v».l~ 1 AG J. ±G 


A rpA TV rn/^mrnrprp 
ATAAIGI 111 


1A1AIA 


G lAAAAlAtb 


GTGACACTTT AAGCACAATT 


181 


GCTGAAG c aa 


TGGGGATTGA 


TGTGCATGTC 


nrirp A CI/"* A /~i A T A 

1 IAGGAGA1 A 


TTAATCATAT 


TG C TAAT ATT 


241 


GACTTAATTT 


TTCCAGACAC 


GATC C T AAC A 


G CAAACT AC A 


ACCAACACGG 


TCAGGCAACG 


3 01 


ACTTTGACGG 


mm/in A rtr\Ti 

TTC AAG CA C C 


TGCTTCTAGT 


CCAI C 1 AGCG 


TTAGTCATGT 


ACCTAGCAGT 


361 


GAGCCATTAC 


CCCAAGCATC 


TGCCACCTCT 


CAACCGACTG 


TTCCTATGGC 


ACCATCTGCG 


4 21 


ACACCATCTG 


ATGTCCCAAC 


G AC AC CAT T C 


GCATCTGLAA 


A /"I /"l /"t A Z" 1 A rp A O 

AG C C AG ATAG 


TTC TGTGAC A 


4 81 


GLGTLATCTG 


AGCTCACATC 


GTCAACGAAT 


/"i A T->r«<T , rp'T'i'-'/--i A 

bAlbl I 1L.GA 


CTGAGTTGTC 


TAGCGAATCA 


5 41 


LAAAAGCAGC 


CAG AAGT AC C 


AC AAG AAG C A 


Gil LtAAL 1 G 


CTAAAGCAGC 


TG AAC CG ACT 


601 


GAAGTCGAAC 


CTAAGACAGA 


CATCTCAGAA 


GACCCAACTT 


CAGCTAATAG 


GCCTGTACCT 


661 


AACGAGAGTG 


CTTCAGAAGA 


AGCTTCTTCT 


GCGGCCCCAG 


CACAAGCTCC 


AGCAGAAAAA 


721 


GAAGAAACCT 


CTCAGATGTT 


AACTGCGCCA 


G CAG C AC AAA 


AAGCTGTAGC 


TGACACCACA 


781 


AGTGTTGCAA 


CCTCAAACGG 


CCTTTCTTAC 


GCTCCAAACC 


ATGCCTACAA 


TCCAATGAAT 


841 


GCAGGGCTTC 


AACCACAAAC 


AGCAGCCTTC 


AAAGAAG AAG 


TGGCTTCTGC 


CTTTGGTATT 


901 


ACGTCATTTA 


GTGGTTACCG 


TCCAGGAGAT 


CCAGGAGATC 


ATGGTAAAGG 


ATTAGCCATT 


961 


GACTTTATGG 


TACCGGTTAG 


CTCTACGCTT 


GGTGATCAAG 


TTGCTCAATA 


TGCCATTGAC 


1021 


CATATGGCAG 


AGCGTGGTAT 


TTCATACGTT 


ATTTGGAAAC 


AGCGATTCTA 


TGCGCCATTT 


1081 


GCAAGTATTT 


ACGGACCAGC 


CTACACATGG 


AACCCCATGC 


CAGATCGCGG 


CAGTATTACA 


1141 


GAAAACCATT 


ATGATCATGT 


TCATGTCTCC 


TTTAATGCTT 


AA (SEQ ID 


NO:3) 



Figure 4 

• 1 . MIITKKSLFV TSVALSLAPL ATAQAQEWTP RSVTEIKSEL VLVDNVFTYI . VKYGDTLSTI 
61 AEAMGIDVHV LGDINHIANI DLIFPDTILT ANYNQHGQAT TLTVQAPASS PSSVSHVPSS 
121 EPLPQASATS QPTVPMAPSA TPSDVPTTPF ASAKPDSSVT ASSELTSSTN DVSTELSSES 
181 QKQPEVPQEA VPTPKAAEPT EVEPKTDISE DPTSANRPVP NESASEEASS AAPAQAPAEK 
241 EETSQMLTAP AAQKAVADTT SVATSNGLSY APNHAYNPMN AGLQPQTAAF KEEVASAFGI 
3 01 TSFSGYRPGD PGDEGKGLAI DFMVPVSSTL GDQVAQYAID HMAERGISYV IWKQRFYAPF. 
361 ASIYGPAYTW NPMPDRGSIT ENHYDHVHVS FNA* (SEQ ID NO: 4) 



2/15 



WO 02/04495 



CA 02413576 2002-12-20 



PCT/CA01/01001 



Figure 5 



1 


ATGATTATTA 


CTAAAAAGAG 


CTTATTTGTG 


ACAAGTGTCG 


CTTTGTCG TT 


AG TAC C T TTG 


61 


GCGACAGCGC 


AGG C AC AAG A 


GTGGACACCA 


CGATCGGTTA 


/"1T» /""ITV TV TV m/"l 7A 7A 

CAGAAATCAA 


GTCTGAACTC 


121 


GTCCTAGTTG 


ATAATGTTTT 


TAC TT AT ACT 


GTAAAATACG 


GTGACACTTT 


TV TV pipt7V pi 7A tv rpm 
AALjLACJAAI 1 


181 


GCTGAAGCAA 


m<^T r-% /~i r~i tv mm/— i tv 

TGGGGATTGA 


TGTGCATGTC 


1 1 wi n\ /*i/~i tv /"i 7v m TV 

TTAGGAGATA 


TTAATCATAT 


rppipirnTV 71 rp tv mm 

TGC 1 AATAT 1 


241 


GACCTAATTT 


TT C C AG AC AC 


GATCCTAACA 


/-I pi TV 71 7V /"im Tt pi TV 

GCAAACTACA 


TV rp pi 71 T/^7\ (~\r*C* 

ATCAACACGG 


rp pi 71 pi pi pi tv tv pi pi 

i CAGGC.AACG 


301 


AATTTGACGG 


TTCAAGCACC 


rppipirpmp > trTt TV pi rp 

TGCTTCTAGT 


P*pi7\ pt P<HP7\ pt p«pl 

CCAGCTAGC G 


TTAGTCATGT 


7V /^/*"T 1 7\/^pi 7\p«rp 


361 


GAGCCATTAC 


CCC AAG CATC 


TGCCACCTCT 


C AAC C GAC TG 


m rp pi pi rn 7. rppi pt r\ 


7\ pi/"i7\ /'"'/' K, T/ r ''/T' 

AUt-AUV- 1 LjUIj 


421 


ACACCATCTG 


ATGTCCCAAC 


GACACCATTC 


GCATCTGCAA 


AG CCAG AT AG 


rprppirnpii-ppi tv p»TV 

T T CTCjT GAC A 


481 


GCGTCATCTG 


AGCTCACATC 


GTCAACGAAT 


GATGTTTCGA 


CTGAGTTGTC 


TAGCGAATCA 


541 


CAAAAGCAGC 


CAGAAGTACC 


AC AAG AAG C A 


GTTCCAACTC 


CTAAAGCAGC 


TGAAACGACT 


601 


GAAGTCGAAC 


CTAAGACAGA 


CATC TCAGAA 


GCCCCAACTT 


CAGCTAATAG 


GCCTGTACCT 


661 


AACGAGAGTG 


CTTCAGAAGA 


AGTTTCTTCT 


GCGGCCCCAG 


CACAAGCCCC 


AG CAG AAAAA 


721 


GAAGAAACCT 


CTGCGCCAGC 


AG C AC AAAAA 


GCTGTAGCTG 


AC AC CACAAG 


TGTTGCAACC 


781 


TCAAATGGCC 


TTTCTTACGC 


TCCAAACCAT 


GCCTACAATC 


CAATGAATGC 


AGGGCTTCAA 


841 


CCACAAACAG 


CAGCCTTCAA 


AGAAGAAGTG 


GCTTCTGCCT 


TTGGTATTAC 


GTCATTTAGT 


901 


GGTTACCGTC 


CAGGTGATCC 


AGGAGATCAT 


GGTAAAGGTT 


TGGCCATTGA 


TTTTATGGTG 


961 


CCTGAAAATT 


CTGCTCTTGG 


TGATCAAGTT 


GCTCAATATG 


CCATTGACCA 


TATGGCAGAG 


1021 


CGTGGTATTT 


CATACGTTAT 


TTGGAAACAG 


CGATTCTATG 


CGCCATTTGC 


AAGTATTTAC 


1081 


GGACCAGCCT 


ACACATGGAA 


CCCCATGCCA 


GATCGCGGCA 


GTATTACAGA 


AAAC CAT TAT 


1141 


GATCATGTTC 


ATGTCTCCTT 


TAATGCT TAA 


(SEQ ID NO: 5) 





1 MIITKKSLFV TSVALSLVPL ATAQAQEWTP 
61 AEAMGIDVHV LGDINHIANI DLIFPDTILT 
121 EPLPQASATS QPTVPMAPPA TPSDVPTTPF 
181 QKQPEVPQEA VPTPKAAETT EVEPKTDISE 
241 EETSAPAAQK AVADTTSVAT SNGLSYAPNH 
3 01 GYRPGDPGDH GKGLAIDFMV PENSALGDQV 
361 GPAYTWNPMP DRGS ITENHY DHVHVSFNA* 



Figure 6 

RSVTEIKSEL VLVDNVFTYT VKYGDTLSTI 
ANYNQHGQAT NLTVQAPASS PASVSHVPSS 
ASAKPDSSVT ASSELTSSTM DVSTELSSES 
APTSANRPVP NESASEEVSS AAPAQAPAEK 
AYNPMNAGLQ PQTAAFKEEV ASAFGITSFS 
AQYAIDHMAE RGISYVIWKQ RFYAPFASIY 
(SEQ ID NO: 6) 
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Figure 7 



1 


ATGATTATTA 


C T AAAAAG AG 


CTTATTTGTG 


ACAAGTGTCG 


CTTTGTCGTT 


AGCACCTTTG 


61 


ppnapapppp 


a ppp ap a ana 


GTGGAPAPPA 


CGATCGGTTA 


CAGAAATCAA 


GTCTGAACTC 


191 
Xo X 


\JX t-l— InUJL lu 


ninALul 111 


TaPTTATAPa 


G^AAAATAPG 


GTGACACTTT 


AAGCACAATT 




P P TPa a HP a a 


TnpppaTTpa 


TPTPPATPTP 


TTAGGAGATA 


TTAATCATAT 


TG C T AATATT 


o a n 


GAL. 1 LAAx X 1 


X 1 LUAuALAt. 


P7ATPP r ra apa 


ppaaaPTapa 


ATPAAPAPGG 


TPAGGPAAPG 


301 


ACTxTGACGG 


men/"' TV TV r*Tl 7\ 

TTCAAGCACC 


1 CjU X X L. J.Ab 1 




TT A PTP a TPT 1 


apPTappaPT 


361 


G AG CCATTAC 


CCCAAGCATC 


1 GL.L.AL,L, 1 L. X 


r*7A Rpr*r»7\PTP 
v,AA^v*uAL 1 ij 


TTPPTaTPPP 


appaTPTPPP 


421 


ACACCATTAG 


CATCTGCAAA 


G CCAGAx. AG x 


1 L. 1 va J. bALAb 


L. \j 1 L. A X v. 1 Vj A 


ppTpapaTPP 


481 


TCAACGAATG 


ATGTTTCGAC 


ICjALrlLtjILl 


AvjrUoAAX LAL 


a aaanpappp 

AAAH LtL. Avj L 


apaaPTappa 


541 


CAAG AAG C AG 


TTCCAACTCC 


TAAAGCAGL 1 




a aPTPnaapp 


Taapapapar 


601 


ATCTCAGAAG 


ACCCAACTTC 


AGCTAATAGG 


CCTGTACCTA 


ACGAGAGTGC 


TTCAGAAGAA 


661 


GTTTCTTCTG 


CGGCCCCAGC 


ACAAGCCCCA 


GCAGAAAAAG 


AAGAAACCTC 


TGCGCCAGCA 


721 


G C AC AAAAAG 


CTGTAGCTGA 


CACCACAAGT 


GTTGCAACCT 


CAAACGGCCT 


TTCTTACGCT 


781 


CCAAACCATG 


CCTACAATCC 


AATGAATGCA 


GGGCTTCAAC 


CACAAACAGC 


AGCCTTCAAA 


841 


GAAGAAGTGG 


CTTCTGCCTT 


TGGTATTACG 


TCATTTAGTG 


GTTACCGTCC 


AGGTGACCCA 


901 


GGAGATCATG 


GTAAAGGTTT 


GGCCATTGAT 


TTTATGGTGC 


CTGAAAATTC 


TGCTCTTGGT 


961 


GATCAAGTTG 


CTCAATATGC 


CATTGACCAT 


ATGGCAGAGC 


GTGGTATTTC 


ATACGTTATT 


1021 


TGGAAACAGC 


GATTCTATGC 


GCCATTTGCA 


AGTATTTACG 


GACCAGCTTA 


CACATGGAAC 


1081 


CCCATGCCAG 


ATCGCGGCAG 


TATTACAGAA 


AACCATTATG 


ATCATGTTCA 


TGTCTCCTTT 



1141 AATGCTTAA (SEQ ID NO: 7) 

Figure 8 

1 MIITKKSLFV TSVALSLAPL ATAQAQEWTP RSVTEIKSEIi VLVD1WFTYT VKYGDTLSTT 
61 AEAMGIDVHV LGDINHIANI DLiIFPDTIIjT ANYNQHGQAT TLTVQAPASS PASVSHVPSS 
121 EPLPQASATS QPTVPMAPSA TPLASAKPDS SVTASSELTS STNDVSTESS SESQKQPEVP 
181 QEAVPTPKAA ETTEVEPKTD ISEDPTSANR PVPNESASEE VSSAAPAQAP AEKEETSAPA 
241 AQKAVADTTS VATSNGLSYA PNHAYNPMNA GLQPQTAAFK EEVASAFGIT SFSGYRPGDP 
301 GDHGKGLAID FMVPENSALG DQVAQYAIDH MAERGISYVI WKQRFYAPFA SIYGPAYTWN 
361 PMPDRGSITE NHYDHVHVSF MA* {SEQ ID MO: 8) 
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Figure 9 



1 


CAAGAGTGGA 


CACCACGATC 


GGTTACAGAA 


ATCAAGTCTG 


AACTCGTCCT 


AGTTGATAAT 


SI 


GTTTTTACTT 


ATACTGTAAA 


ATACGGTGAC 


ACTTTAAGCA 


CAATTGCTGA 


AGCAATGGGA 


12 1 


ATTGATGTGC * 


ATGTCTTAGG 


AGATATTAAT 


CATATTGCTA ATATTGACTT 


AATTTTTCCA 


181 


GACACGATCC 


TAACAGCCAA 


CTACAACCAA 


CACGGTCAGG 


CAACGACTTT 


GACGGTTCAA 


241 


GCGCCTGCTT 


CTAGTCCAGC 


TAGCGTTAGT 


CATGTACCTA 


GCAGTGAGCC 


ATTACCCCAA 


301 


GCATCTGCCA 


CCTCTCAATC 


GACTGTTCCT 


ATGGCACCAT 


CTGCGACACC 


ATCTGATGTC 


361 


CCAACGACAC 


CATTCGCATC 


TGCAAAGCCA 


GATAGTTCTG 


TGACAGCGTC 


ATCTGAGCTC 


t: j£ -L 


ACATCnTPAA 


CG AATGATGT 


TTCGACTGAG 


TTGTCTAGCG 


AATCACAAAA 


GCAGCCAGAA 


481 


fiTATCAPAAG 


AAGCAGTTCC 


AACTCCTAAA 


GCAGCTGAAA 


CGACTGAAGT 


CGAACCTAAG 


54 1 


AC AGAC ATC T 


CAGAGGATTC 


AACTTCAGCT 


AATAGGCCTG 


TACCTAACGA 


GAGTGCTTCA 


601 


GAAGAAGTTT 


CTTCTGCGGC 


/— 1/— 1 /~1 Tl /""I 71 /"I TV TV 

CCCAGCACAA. 


GCCCCAGCAG 


AAAAAGAAGA 




661 


CCAGCAGCAC 


AAAAAGCTGT 


AGCTGACACC 


ACAAGTGTTG 


CAACCTCAAA 


TGGCCTTTCT 


721 


TACGCTCCAA 


ACCATGCCTA 


CAATCCAATG 


AATGCAGGGC 


TTCAACCACA 


AACAGCAGCC 


781 


TTCAAAGAAG 


AAGTGGCTTC 


TGCCTTTGGT 


ATTACGTCAT 


TTAGTGGTTA 


CCGTCCAGGT 


841 


GATCCAGGAG 


ATCATGGTAA 


AGGTTTGGCC 


ATTGATTTTA 


TGGTGCCTGA 


AAATTCTGCT 


901 


CTTGGTGATC 


AAGTTGCTCA 


ATATGCCATT 


GACCATATGG 


CAGAGCGTGG 


TATTTCATAC 


961 


GTTATTTGGA 


AACAGCGATT 


CTATGCGCCA 


TTTGCAAGTA 


TTTACGGACC 


AGCCTACACA 


1021 


TGGAACCCCA 


TGCCAGATCG 


CGGCAGTATT 


ACAGAAAACC 


ATTATGATCA 


TGTTCATGTC 


1081 


TCCTTTAATG 


CTTAA (SBQ 


ID NO; 9) 









Figure 10 

1 QEWTPRS.VTE IKSELVLVDN VFTYTVKYGD TLSTIAEAMG IDVHVLGDIN HIANIDLIFP 
61 DTILTANYNQ HGQATTLTVQ APASSPASVS HVPSSEPLPQ ASATSQSTVP MAPSATPSDV 
121 PTTPFASAKP DSSVTASSEL TSSTNDVSTE LSSESQKQPE . VP.QEAVPTPK AAETTEVEPK 
181 TDISEDSTSA NRPVPNESAS EEVSSAAPAQ APAEKEETSA PAAQKAVADT TSVATSNGLS 
241 YAPNHAYNPM NAGLQPQTAA FKEEVASAFG ITSFSGYRPG DPGDHGKGLA IDFMVPENSA 
3 01 LGDQVAQYAI DHMAERGISY VIWKQRFYAP FASIYGPAYT WNPMPDRGSI TENHYDHVHV 
3 61 SFNA* (SEQ ID NO: 10) 
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Figure 11 

1 CAAGAGTGGA CACCACGATC GGTTACAGAA ATCAAGTCTG AACTCGTCCT AGTTGATAAT 

61 GTTTTTACTT ATATAGTAAA ATACGGTGAC ACTTTAAGCA CAATTGCTGA AGCAATGGGG 

121 ATTGATGTGC ATGTCTTAGG AGATATTAAT CATATTGCTA ATATTGACTT AATTTTTCCA 

181 GACACGATCC TAACAGCAAA CTACAACCAA CACGGTCAGG CAACGACTTT GACGGTTCAA 

241 GCACCTGCTT CTAGTCCATC TAGCGTTAGT CATGTACCTA GCAGTGAGCC ATTACCCCAA 

301 GCATCTGCCA CCTCTCAACC GACTGTTCCT ATGGCACCAT CTGCGACACC ATCTGATGTC 

361 CCAACGACAC CATTCGCATC TGCAAAGCCA GATAGTTCTG TGACAGCGTC ATCTGAGCTC 

421 ACATCGTCAA CGAATGATGT TTCGACTGAG TTGTCTAGCG AATCACAAAA GCAGCCAGAA 

481 GTACCACAAG AAGCAGTTCC AACTCCTAAA GCAGCTGAAC CGACTGAAGT CGAACCTAAG 

541 ACAGACATCT CAGAAGACCC AACTTCAGCT AATAGGCCTG ACCTAACGA GAGTGCTTCA 

601 GAAGAAGCTT CTTCTGCGGC CCCAGCACAA GCTCCAGCAG . AAAAAGAAGA AACCTCTCAG 

661 ATGTTAACTG CGCCAGCAGC ACAAAAAGCT GTAGCTGACA CCACAAGTGT TGCAACCTCA 

721 AACGGCCTTT CTTACGCTCC AAACCATGCC TACAATCCAA TGAATGCAGG GCTTCAACCA 

781 CAAACAGCAG CCTTCAAAGA AGAAGTGGCT TCTGCCTTTG GTATTACGTC ATTTAGTGGT 

841 TACCGTCCAG GAGATCCAGG AGATCATGGT AAAGGATTAG CCATTGACTT TATGGTACCG 

901 GTTAGCTCTA CGCTTGGTGA TCAAGTTGCT CAATATGCCA TTGACCATAT GGCAGAGCGT 

961 GGTATTTCAT ACGTTATTTG GAAACAGCGA TTCTATGCGC CATTTGCAAG TATTTAC GG A 

1021 CCAGCCTACA CATGGAACCC CATGCCAGAT CGCGGCAGTA TTACAGAAAA C C ATT AT GAT 

1081 CATGTTCATG TCTCCTTTAA TGCTTAA {SEQ ID NO: 11) 



Figure 12 

1' QEWTPRSVTE' IKSELVLVDN VFTYIVKYGD TLSTIAEAMG IDVHVLGDIN HIANIDLIFP ' 
61 DTILTANYNQ HGQATTLTVQ APASSPSSVS HVPSSEPLPQ ASATSQPTVP MAPSATPSDV 
121 PTTPPASAKP DSSVTASSEL TSSTNDVSTE LSSESQKQPE VPQSAVPTPK AAEPTEVEPK 
181 TDISEDPTSA NRPVPNESAS EEASSAAPAQ APAEKEETSQ MLTAPAAQKA VADTTSVATS 
241 NGLSYAPNHA YNPMNAGLQP QTAAFKEEVA SAFGITSFSG YRPGDPGDHG KGLAIDFMVP 
301 VSSTLGDQVA QYAIDHMAER GTSYVIWKQR FYAPFASIYG FAYTWNPMPD RGSITEWHYD 
361 HVHVSFNA* (SEQ ID NO: 12) 
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Figure 13 



-1 
J_ 


P2i 7v P.AP,TP.P,A 


CACCACGATC 


GGTTACAGAA 


ATPAAPTPTP* 


AAPTPGTPCT 


AGTTGATAAT 

n\j X iUninnl 


D X, 


Ulllll rlL- J. J. 


ATACTGTAAA ATACGGTGAC 


A PTTT A AGP A 


PAATTGCTGA 


AGCAATGGGG 


121 




ATGTCTTAGG 


AGATATTAAT 


CATATTGCTA 


ATATTGACCT 


AATTTTTCCA 


i-O _L 


hapapratpp 


TAACAGCAAA 


CTACAATCAA 


CACGGTCAGG 


CAACGAATTT 


GACGGTTCAA 


241 




CTAGTCCAGC 


TAGCGTTAGT 


CATGTACCTA 


GCAGTGAGCC 


ATTACCCCAA 




p, r» a t PTP, PP A 


CCTCTCAACC 


GACTGTTCCT 


ATGGCACCAC 


CTGCGACACC 


ATCTGATGTC 




PPA APPAPAP 


CATTCGCATC 


TGCAAAGCCA 


GATAGTTCTG 


TGACAGCGTC 


ATCTGAGCTC 


ATT 
ft zl J. 


A P A TfRTH A A 


CGAATGATGT 


TTCGACTGAG 


TTGTCTAGCG 


AATCACAAAA 


GCAGCCAGAA 


ART 

O J. 


PTAPPAPA AP 


AAGCAGTTCC 


AACTCCTAAA 


GCAGCTGAAA 


CGACTGAAGT 


CGAACCTAAG 


541 


ACAGACATCT 


CAGAAGCCCC 


AACTTCAGCT 


AATAGGCCTG 


TACCTAACGA 


GAGTGCT1 CA 


601 


GAAGAAGTTT 


CTTCTGCGGC 


CCCAGCACAA 


GCCCCAGCAG 


AAAAAGAAGA 


AACCTCTGCG 


661 


CCAGCAGCAC 


AAAAAGCTGT 


AGCTGACACC 


ACAAGTGTTG CAACCTCAAA 


TGGCCTTTCT 


721 


TACGCTCCAA 


ACCATGCCTA 


CAATCCAATG 


AATGCAGGGC 


TTCAACCACA 


AACAGCAGCC 


781 


TTCAAAGAAG 


AAGTGGCTTC 


TGCCTTTGGT 


ATTACGTCAT TTAGTGGTTA 


CCGTCCAGGT 


841 


GATCCAGGAG 


ATCATGGTAA AGGTTTGGCC 


ATTGATTTTA 


TGGTGCCTGA 


AAATTCTGCT 


901 


CTTGGTGATC 


AAGTTGCTCA 


ATATGCCATT 


GACCATATGG 


CAGAGCGTGG 


TATTTCATAC 


961 


GTTATTTGGA 


AACAGCGATT 


CTATGCGCCA 


TTTGCAAGTA 


TTTACGGACC 


AGCCTACACA 


1021 


TGGAACCCCA 


TGCCAGATCG 


CGGCAGTATT 


ACAGAAAACC 


ATTATGATCA 


TGTTCATGTC 


1081 


TCCTTTAATG 


CTTAA ( SEQ 


ID NO:13) 









. Figure 14 

1 QEWTPRSVTE IKSELVLVDN VFXYTVKYGD TLSTIAEAMG IDVHVLGDIN HIANIDLIFP 

61 DTILTANYNQ . HGQATNLTYQ APASSPASVS HVPSSEPLPQ . ASATSQPTVP MAPPATPSDV 

121 PTTPFASAKP DSSVTASSEL TSSTNDVSTE LSSESQKQPE VPQEAVPTPK AAETTEVEPK 

181 TDISEAPTSA NRPVPNESAS EEVSSAAPAQ APAEKEETSA PAAQKAVADT TSVATSNGLS 

241 YAPNHAYNPM NAGLQPQTAA FKEEVASAFG ITSFSGYRPG DPGDHGKGLA IDFMVPENSA 

301 LGDQVAQYAI DHMAERGISY VIWKQRFYAP FASIYGPAYT WNPMPDRGSI TENHYDHVHV 
361 SFNA* (SEQ ID NO:14) 
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Figure 15 



1 


CAAGAGTGGA 


CACCACGATC 


GGTTACAGAA 


ATCAAGTCTG 


AACTCGTCCT 


AGTTGATAAT 


61 


GTTTTTACTT 


ATACAGTAAA 


ATACGGTGAC 


ACTTTAAGCA 


CAATTGCTGA 


AGCAATGGGG 


121 


ATTGATGTGC 


ATGTCTTAGG 


AGATATTAAT 


CATATTGCTA 


ATATTGACTT 


AATTTTTCCA 


181 


GACACGATCC 


TAACAGCAAA 


CTACAATCAA 


CACGGTCAGG 


CAACGACTTT 


GACGGTTCAA 


241 


GCACCTGCTT 


CTAGTCCAGC 


TAGCGTTAGT 


CATGTACCTA 


GCAGTGAGCC 


ATTACCCCAA 


301 


GCATCTGCCA 


CCTCTCAACC 


GACTGTTCCT 


ATGGCACCAT 


CTGCGACACC 


ATTAGCATCT 


361 


GCAAAGCCAG 


ATAGTTCTGT 


GACAGCGTCA 


TCTGAGCTCA 


CATCGTCAAC 


GAATGATGTT 


421 






A.TCACAAAAG 


CAGCCAGAAG 


TACCACAAGA 


AGCAGTTCCA 


to J. 


APTPPTAAAf; 


PAfJPTfiAAAP 


OAPTGAAGTP 


GAAPCTAAGA 


CAGACATCTC 


AGAAGACCCA 


541 


ACTTCAGCTA 


ATAGGCCTGT 


ACCTAACGAG 


AGTGCTTCAG 


AAGAAGTTTC 


TTCTGCGGCC 


601 


CCAGCACAAG 


CCCCAGCAGA 


AAAAGAAGAA 


ACCTCTGCGC 


CAGCAGCACA 


AAAAGCTGTA 


661 


GCTGACACCA 


CAAGTGTTGC 


AACCTCAAAC 


GGCCTTTCTT 


ACGCTCCAAA 


CCATGCCTAC 


721 


AATCCAATGA 


ATGCAGGGCT 


TCAACCACAA 


ACAGCAGCCT 


TCAAAGAAGA 


AGTGGCTTCT 


781 


GCCTTTGGTA 


TTACGTCATT 


TAGTGGTTAC 


CGTCCAGGTG 


ACCCAGGAGA 


TCATGGTAAA 


841 


GGTTTGGCCA 


TTGATTTTAT 


GGTGCCTGAA 


AATTCTGCTC 


TTGGTGATCA 


AGTTGCTCAA 


901 


TATGCCATTG 


ACCATATGGC 


AGAGCGTGGT 


ATTTCATACG 


TTATTTGGAA 


ACAGCGATTC 


961 


TATGCGCCAT 


TTGCAAGTAT 


TTACGGACCA 


GCTTACACAT 


GGAACCCCAT 


GCCAGATCGC 


1021 


GGCAGTATTA 


CAGAAAACCA 


TTATGATCAT 


GTTCATGTCT 


CCTTTAATGC 


TTAA (SEQ ID 



NO:15) 

Figure 16 

1 QEWTPRSVTE IKSELVLVDN VFTYTVKYGD TLSTIAEAMG IDVHVLGDIN HIANIDLIPP 
61 DTILTANYNQ HGQATTLTVQ APASSPASVS HVPSSEPIjPQ ASATSQPTVP MAPSATPLAS 
121 .AKPDSSVTAS SELTSSTNDV STESSSESQK QPEVPQEAVP TPKAAETTEV EPKTDISEDP 
181 TSANRPVPNE SASEEVSSAA PAQAPAEKEE TSAPAAQKAV ADTTSVATSN GLSYAPNHAY 
241 NPMNAGLQPQ TAAFKEEVAS AFGITSFSGY RPGDPGDHGK GLAIDFMVPE NSALGDQVAQ 
301 YAIDHMAERG ISYVIWKQRF YAPFAS I YGP AYTWNPMPDR GSITENHYDE VHVSFNA* 
(SEQ ID NO: 16) 
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Figure 17 



12384 


1 


ATGATTATTACTAAAAAGAGCTTATTTGTGACAAGTGTCGCTTTGTCGTT 


50 


2699 


1 


ATGATTATTACTAAAAAGAGCTTATTTGTGACAAGTGTCGCTTTGTCGTT 


50 


B514 


1 


ATGATTATTACTAAAAAGAGCTTATTTGTGACAAGTGTCGCTTTGTCGTT 


50 


SpyS7 


1 


ATGATTATTACTAAAAAGAGCTTATTTGTGACAAGTGTCGCTTTGTCGTT 


50 


U09352 


1 


ATGATTATTACTAAAAAGAGTTTATTTGTGACAAGTGTCGCTTTGTCGTT 


50 


Oklahoma 


1 


ATGATTATTACTAAAAAGAGCTTATTTGTGACAAGTGTCGCTTTGTCGTT 


50 






******************** ***************************** 




12384 


51 


AGCACCTTTGGCGACAGCACAGGCACAAGAGTGGACACCACGATCGGTTA 


100 


2699 


51 


AGCACCTTTGGCGACAGCGCAGGCACAAGAGTGGACACCACGATCGGTTA 


100 


B514 


51 


AGCACCTTTGGCGACAGCGCAGGCACAAGAGTGGACACCACGATCGGTTA 


100 


Spy 5 7 


51 


AGTACCTTTGGCGACAGCGCAGGCACAAGAGTGGACACCACGATCGGTTA 


100 


U09352 


51 


AGCACCTTTGGCGACAGCGCAGGCACAAGAGTGGACACCACGATCGGTTA 


100 


Oklahoma 


51 


AGTACCTTTGGCGACAGCGCAGGCACAAGAGTGGACACCACGATCGGTrA 


100 






** *************** ******************************* 




12384 


101 


CAGAAATCAAGTCTGAACTCGTCCTAGTTGATAATGTTTTTACTTATACT 


150 


2699 


101 


CAGAAATCAAGTCTGAACTCGTCCTAGTTGATAATGTTTTTACTTATATA 


150 


B514 


101 


CAGAAATCAAGTCTGAACTCGTCCTAGTTGATAATGTTTTTACTTATACA 


150 


Spy 5 7 


101 


CAGAAATCAAGTCTGAACTCGTCCTAGTTGATAATGTTTTTACTTATACT 


150 


U09352 


101 


CACAAATCAAGTCTGAACTCGTCCTAGTTGATAATGTTTTTACTTATACA 


150 


Oklahoma 


101 


CAGAAATCAAGTCTGAACTCGTCCTAGTTGATAATGTTTTTACTTATACT 


150 






** ********************************************* 




123 84 


151 


GTAAAATACGGTGACACTTTAAGCACAATTGCTGAAGCAATGGGAATTGA 


200 


2699 


151 


GTAAAATACGGTGACACTTTAAGCACAATTGCTGAAGCAATGGGGATTGA 


200 


B514 


151 


GTAAAAT AC GGT G AC ACTTTAAGC AC AATTGCTG AAG C AATGG G G ATTGA 


200 


Spy5 7 


151 


GTAAAATACGGTGACACTTTAAGCACAATTGCTGAAGCAATGGGGATTGA 


200 


U09352 


151 


GTAAAATACGGTGACACTTTAAGCACAATTGCTGAAGCAATGGGGATTGA 


200 


Die! ^VinmP5 


• 151 


G T AAAATACGGTG AC A CTTTAAG C AC AATTGC T G AAGCAATGGGGATTGA 


200 






******************************************** ***** 




12384 


201 


TGTGCATGTCTTAGGAGATATTAATCATATTGCTAATATTGACTTAATTT 


250 


2699 


201 


T GTG C ATGTCT TAGGAGATATTAATCAT ATTGC TAAT ATTGACTTAATTT 


250 


B514 


201 


T GTG C ATGTCT T AGG AG ATATTAATC AT ATTG C TAATATTG ACT TAATTT 


250 


Spy57 


• 201. 


T GTG C ATGTCT TAGGAGATATTAATCAT ATTG C TAATATTGAC C TAATTT • 


. 250 


U09352 


201 


T GTG CATGTCT TAGGAGATAT TAATC ATATTG C TAATATTGACT TAATTT 


250 


Oklahoma. 


201 


T GTG CATGTCT TAGGAGATATTAATCAT ATTG C TAAT ATTGAC C TAATTT 


250 






******************************************* ******. 




12384 


251 


T TC CAGAC ACGATCCTAACAGC CAACTAC AAC CAAC ACGGTCAGG CAACG 


300 


2699 


251 


T TC CAGAC ACGATCCTAAC AG CAAACTAC AAC CAAC ACGGTCAGG CAACG 


300 


B514 


251 


TTCCAGACACGATCCTAACAGCAAACTACAATCAAC ACGGTCAGG CAACG 


300 


Spy 5 7 


OCT 

2bl 


1 1 LLAkALALCjAI LL. TAALACjCAAAC 1 ACAA JL CAALALvjCj 1 CACjVjLAALCj 


i r\ n 


U09352 


251 


T TC CAGAC ACGATCCTAAC AGCAAACTAC AAC CAAC ACGGTCAG G CAACG 


300 


Oklahoma 


251 


T TC CAGAC ACGATCCTAAC AGCAAACTACAATCAACACGGT CAGGC AACG 


300 






********************** ******** ****************** 




12384 


301 


ACTTTGACGGTTCAAGCGCCTGCTTCTAGTCCAGCTAGCGTTAGTCATGT 


350 


2699 


301 


ACTTTGACGGTTCAAGCACCTGCTTCTAGTCCATCTAGCGTTAGTCATGT 


350 


3514 


301 


ACTTTGACGGTTCAAGCACCTGCTTCTAGTCCAGCTAGCGTTAGTCATGT 


350 


Spy 5 7 


301 


AATTTGACGGTTCAAGCACCTGCTTCTAGTCCAGCTAGCGTTAGTCATGT 


350 


U09352 


301 


ACTTTGACGGTTCAAGCGCCTGCTTCTAGTCCAGCTAGCGTTAGTCATGT 


350 


Oklahoma 


301 


AATTTGACGGTTCAAGCACCTGCTTCTAGTCCAGCTAGCGTTAGTCATGT 


350 






* *************** *************** **************** 
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T 1 "> O A 


ibl 


7\ r i r tr T7\rir i ari , T , ^'ar2r , r , 7i , T r rnr , pr , r , 7\ a r^PA T , r ,r rnr i P Anr'TPTPA. ziT'PP.A.pnpfi 


400 


2 699 


351 


a A/im 7v r< p a prpp ts. rt pATi mrp a onnnji 7\ r* a tptp ppa. ppt'PT'Paa npftApT'P. 


400 


B514 


351 


ACCTAGCAGTGAGCCATTACCCCAAGCATCTGCCACCTCTCAACCGACTG 




Spy 5 7 


351 


AC CTAGCAGTG AGC C ATTAC C C C AAG CAT CTGC CAC C T CT CAAC C G ACTG 


a nn 
ft u u 


t T A ft ft r- o 

U09352 


351 


ACCTAGCAGTGAGCCATTACCCCAAGCATCTGCCACCTCTCAATCGACTA 


a on 
ft uu 


Oklahoma 


351 


ACCTAGCAGTGAGCCATTACCCCAAGCATCTGCCACCTCTCAACCGACTG 


400 






******************************************* ***** 




12 384 


401 


TTCCTATOGCACCATCTGCGACACCATCTGATGTCCCAACGACACCATTC 


ft 3 U 


2699 


401 


TTCCTATGGCACCATCTGCGACACCATCTGATGTCCCAACGACACCATTC 


a ^ n 

ft o U 


B5 14 


401 


TTCCTATGGCACCATCTGCGACACCAT TA 


429 


Spy 5 7 


4 01 TTCCTATGGCACCACCTGCGACACCATCTGATGTCCCAACGACACCATTC 


450 


TT A ft "1 C ft 

U09352 


401 


TTCCTATGGCACCATCTGCGACACCATCTGATGTCCCAACGACACCATTA 


450 


Oklahoma 


4 01 


TTCCTATGGCACCACCTGCGACACCATCTGATGTCCCAACGACACCATTC 


450 






************** ************ * 




12384 


451 


G C ATC TG C AAAGC C AGATAGTTCTGT GAC AGCGTCAT CTG AGC TC AC ATC 


o u u 


2699 


451 


GCATCTGCAAAGCCAGATAGTTCTGTGACAGCGTCATCTGAGCTCACATC 


jUU 


B514 


430 


GCATCTGCAAAGCCAGATAGTTCTGTGACAGCGTCATCTGAGCTCACATC 


A 1 CJ 
f! / J 


Spy 5 7 


451 


GCATCTGCAAAGCCAGATAGTTCTGTGACAGCGTCATCTGAGCTCACATC 


cnn 

buu 


U09352 


451 


GCATCTGCAAAGCCAGATAGTTTTGTGACAGCGTCATCTGAGCTCACATC 


cnn 
500 


Oklahoma 


451 


GCATCTGCAAAGCCAGATAGTTCTGTGACAGCGTCATCTGAGCTCACATC 


500 






********************** *************************** 




12384 


501 


GTCAACGAATGATGTTTCGACTGAGTTGTCTAGCGAATCACAAAAGCAGC 


550 


2699 


501 


GTCAACGAATGATGTTTCGACTGAGTTGTCTAGCGAATCACAAAAGCAGC 


c « n 
5oU 


B514 


480 


GTCAACGAATGATGTTTCGACTGAGTCGTCTAGCGAATCACAAAAGCAGC 


C ft Q 

523 


Spy 5 7 


501 


GTCAACGAATGATGTTTCGACTGAGTTGTCTAGCGAATCACAAAAGCAGC 


pen 

55U 


U09352 


501 


ATCAACGAATGATGTTTCGACTGAGTTGTCTAGCGAATCACAAAAGCAGC 


bbU 


Oklahoma 


501 


GTCAACGAATGATGTTTCGACTGAGTTGTCTAGCGAATCACAAAAGCAGC 


550 






************************* *********************** 




12384 


551 


C AGAAGT AC C AC AAGAAGCAGTTC C AAC TCCTAAAG C AGCTGAAACGAC T 


cnn 
b UU 


2699 


551 


C AG AAGT AC C AC AAGAAGCAGTTC C AACT C CTAAAG CAGCTG AAC CGACT 


cnn 
bUU 


B514 


530 


CAGAAGTACCACAAGAAGCAGTTCCAACTCCTAAAGCAGCTGAAACGACT 


ETC 

5 /i? 


Spy 5 7 


551 


CAGAAGTAC CAC AAG AAG CAGTTCCAACTCCT AAAGC AG CTG AAACG ACT 


cnn 


U09352 


551 


CAGAAGTACCACAAGAAGCAGAAC CAACTCCTAAAG CAGCTGAAAGCACT 


DUU 


• Oklahoma • 


5 51 


CAGAAGTACCACAAGAAGCAGTTCCAACTCCTAAAGCAGCTGAAACGACT 


600 




j 


********************* ********************* *** 




12384 


601 


gaAgtcgaacctaagacagacatctcagaggattcaacttcagctaatag 


650 


2699 


601 


G AAGTCGAAC CTAAGAC AGAC ATCTC AG AAG AC CCAACTTC AGC T AATAG 


c z n 
ball 


B514 


580 


GAAGTCGAACCTAAGACAGACATCTCAGAAGACCCAACTTCAGCTAATAG 




Spy 5 7 


601 


G AAGT CGAAC CTAAGAC AGAC ATC T C AG AAGCCC C AACTTC AG CTAAT AG 


bbU 


U09352 


601 


GAAG T CG AACCTAAG AC AGAC ATCTC AG AAGATTCAACTTCAGCT AATAG 


c c n 


Oklahoma 


601 


GAAG TC GAACCTAAG AC AG AC ATC TC AGAAGC C C C AACTTC AG CTAAT AG 


650 






***************************** * **************** 




12384 


651 


GCCTGTACCTAACGAGAGTGCTTCAGAAGAAGTTTCTTCTGCGGCCCCAG 


/ UU 


2699 


651 


GCCTGTACCTAACGAGAGTGCTTCAGAAGAAGCTTCTTCTGCGGCCCCAG 


700 


B514 


630 


GCCTGTACCTAACGAGAGTGCTTCAGAAGAAGTTTCTTCTGCGGCCCCAG 


CH Q 


Spy 5 7 


651 


G C CT GTAC CTAACGAG AGTG C TTCAGAAGAAGTTT C T TCTGCGG C C C CAG 


*7 n n 


U09352 


651 


GCCTGTACCTAACGGAAGTGCTTCAGAAGAAGCTTCTTCTGCGGCCCCAG 


*7 A o 
/ UU 


Oklahoma 


651 


G C CTGT AC C T AAC G AGAGTGCTTC AGAAGAAGTTTC TTCTGCGG C C C CAG 


700 






************** **************** ***************** 




12384 


701 


CACAAGCCCCAGCAGAAAAAGAAGAAACCTCT GCGCCA 


738 


2699 


701 


CACAAGCTCCAGCAGAAAAAGAAGAAACCTCTCAGATGTTAACTGCGCCA 


750 


B514 


680 


C ACAAG CC C CAG CAGAAAAAGAAG AAAC CT CT GCGCCA 


717 


Spy 5 7 


701 


C ACAAGCCC CAGCAGAAAAAGAAGAAACCTCT GCGCCA 


738 


U09352 


701 


C ACAAG CTC C AGC AG AAAAAG AAG AAAC CTC T CAGATGTTAACTG CGC CA 


750 


Oklahoma 


701 


CACAAGCCCCAGC AGAAAAAGAAGAAACCTCT GCGCCA 


738 






******* ************************ ****** 
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±Z J tt 4 


739 


G v« AGC ACAAAAAGL. 1 G I AvjL. I bALALLALAAb 1 G X. L bbAALL 1 t-AttA I GG 


TOO 


O Q Q 

2699 


751 


G CAGCACAAAAAGCTGTAGCTGACACCACAAGTGTTGGAACC TCAAACGG 


800 


B514 


718 


G C AGC AC AAAAAG CTGTAG CT GAC ACC ACAAG TGTTGC AAC C T C AAAC GG 


767 


Spy 5 7 


739 


G C AG CACAAAAAG CTGTAG C TGAC ACC ACAAG TGTTG CAACC T CAAATGG 


788 


U09352 


751 


GCAGCACAAAAAGCTGTAGCTGACACCACAAG TGTTGC AACCTCAAACGG 


800 


Oklahoma 


739 


GCAGCACAAAAAGCTGTAGCTGACACCACAAGTGTTGCAACCTCAAATGG 


788 






*********************************************** * * 




12384 


789 


C C TTTCTT AC G C T C C AAAC CATGC C TAC AATC CAATG AATGC AGGG CTTC 


83 8 


2699 


801 


CCTTTCTTACGCTCCAAACC ATGCCTACAA 1 C LAA TbAA X GLAGGGLTTC 


850 


B514 


768 


CCTTTCTTACGCTCCAAACCATGCL1 ACAA I L LAA I GAA I GLAGGGLTTC 


817 


Spy 5 7 


7 89 


/*i /*ti 1 11 1 in i/'ii i ii r vtv /im*""i /~iti tv t» /I/in m/innrmi /™»a 7\ mn<"iA ti m/i t\ a m/i/rn nrto/^mmn 

CCTTTCTTACGCTCCAAACCATGCCTACAATCCAATGAATGCAGGGCTTC 


838 


U09352 


801 


CCTTTCTTACGCTCCAAACCATGCCTACAATCCAATGAATGCAGGGCTTC 


850 


Oklahoma 


789 


CCTTTCTTACGCTCCAAACCATGCCTACAATCCAATGAATGCAGGGCTTC 


838 






£*tV************************x**£&^****^******&***** 




123 84 


O "5 O 

83 9 


AACCACAAACAGCAGCCTTCAAAGAAGAAGTGGLTl L IGLL I 1 J. GGTATT 


Q O Q 

8 8 8 


2699 


851 


AAC C AC AAAC AGC AG CCTTCAAAGAAG AAGTG GCTTCTGCCTTTGGT ATT 


900 


B514 


818 


a ~n /"t/" > i ti otv a a a a /"i r"irnm/-» a AA/^AAnAA rirn/mni i » i i/~*rn/-< rTimmTTTTn t\ mm 

AACCACAAACAGCAGCLTTLAAAGAAGAAG IGGL I I L IGLL I I IGGTATT 


867 


Spy 5 7 


839 


Tv tv t\ #**t iv tv TV ^i TV /*i /"t tv /*i /i /-i, iij ii/i tv 7V Tv /*1 tv Tv f~% ta 7V rtrn/i /i /-t m - m nm/^ /"i rn T\ mm 

AAC C AC AAAC AGC AG CCTTCAAAGAAG AAGTG GCTTCTGCCTTTGGT ATT 


388 


U09352 


851 


AACC AC AAACAGCAGCCTTCAAAGAAG AAGTG - CTTCTGCCTTTGGTATT 


899 


Oklahoma 


839 


AACCACAAACAGCAGCCTTCAAAGAAGAAGTGGCTTCTGCCTTTGGTATT 


888 






******************************** ***************** 




12384 


889 


ACGTCATTTAGTGGTTACCGTCCAGGTGATCCAGGAGATCAT - GGTAAAG 


937 


2699 


901 


ACGTCATTTAGTGGTTACCGTCCAGGAGATCCAGGAGATCAT - GGTAAAG 


949 


B514 


868 


ACGTCATTTAGTGGTTACCGTCCAGGTGACCCAGGAGATCAT - GGTAAAG 


916 


Spy5 7 


889 


ACGTCATTTAGTGGTTACCGTCCAGGTGATCCAGGAGATCAT - GGTAAAG 


937 


U09352 


900 


ACGTCATTTAGTGGTTACCGTCCAGGAGATCCAGGAGATCATTGGTAAAG 


949 


Oklahoma 


889 


ACGTCATTTAGTGGTTACCGTCCAGGTGATCCAGGAGATCAT - GGTAAAG 


937 






•kft&'k'ie'ftje'k'k'k'kJr'tftc'k'tc'kikjr'k'k'ic'ttie'le'k "k -k •kit-k'k'k'k-k^t-ie'it-ff-it tV 7V 7V * Vt 7V 




12384 


938 


GTTTGGCCATTGATTTTATGGTGCCTGAAAATTCTGCTCTTGGTGATCAA 


987 


2699 


950 


GATTAGCCATTGACTTTATGGTACCGGTTAGCTCTACGCTTGGTGATCAA 


999 


B514 


917 


GTTTGGCCATTGATTTTATGGTGCCTGAAAATTCTGCTCTTGGTGATCAA 


966 


Spy57 


938 


GTTTGGCCATTGATTTTATGGTGCCTGAAAATTCTGCTCTTGGTGATCAA 


987 


U09352 


950 


• G ATT AGCC ATTGAC TTTATGGTAC CG GTTAG C TG T AGGCTTGGTGATC AA 


999 


Oklahoma 


938 


GTTTGGCCATTGATTTTATGGTGCCTGAAAATTCTGCTCTTGGTGATCAA 


987 






4. -i- -i- 1^4.4,4,44.1. 4.4.4.4,^1,4.4.4. 4-4. 4. 4. 4.4.4. 4. ^U4>4>4.^k.^4.4.4.4.4.4, 




123 84 


o o o 

9 88 


GTTG CTCAATATGCC ATTGAC CAlAlGGCAGAGLGrGGlAI 1 1 CAiACGT 


1037 


2699 


1000 


G TTG CTC AATA J. G CC ATTGAC CAT ATGG C AGAGCGTGGTATT T C ATACGT 


1049 


B514 


967 


GTTG CTC AATATGCC ATTGAC CAT ATGG CAGAGCGTGGTATT TC AT ACGT 


1016 


Spy 5 7 


988 


GTTG CTC AATATG CC ATTGAC CAT ATGG CAGAGCGTGGTATT T C ATACGT 


1037 


U09352 


1000 


GTTG CTC AAT AT G CC ATTGAC CAT ATGG CAGACG GTGGTATTT CAT ACGT 


1049 


Oklahoma 


988 


G TTG CTCAATATGCCATTGACCATATGG CAGAGCGTGGTATT T CAT ACGT 


1037 






*7V**7f**7f**7> T/TCTV****************** TTTfTV******* ****** 




12384 


1038 


TATTTGGAAACAGCGATTCTATGCGCCATTTGCAAGTATTTACGGACCAG 


1087 


2699 


1050 


TATTTGGAAACAGCGATTCTATGCGCCATTTGCAAGTATTTACGGACCAG 


1099 


B514 


1017 


TATTTGGAAACAGCGATTCTATGCGCCATTTGCAAGTATTTACGGACCAG 


1066 


Spy 5 7 


1038 


TATTTGGAAACAGCGATTCTATGCGCCATTTGCAAGTATTTACGGACCAG 


1087 


U09352 


1050 


TATTTGGAAACAGCGATTCTATGCGCCATTTGCAAGTATTTACGGACCAG 


1099 


Oklahoma 


1038 


TATTTGGAAACAGCGATTCTATGCGCCATTTGCAAGTATTTACGGACCAG 


1087 






************************************************** 
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12384 1088 C C T AC AC ATGG AAC C C C ATGC C AG AT CGCGGC AGTATTAC AG AA A AC CAT 1137 

2699 1100 CCTAC ACATGG AAC CC C ATG C C AGATCG C GGC AGTATTAC AG AAAAC CAT 1149 

B514 1067 CTTACACATGGAACCCCATGCCAGATCGCGGCAGTATTACAGAAAACCAT 1116 

Spy 5 7 1088 C C T AC ACATGGAAC CC C ATGC CAGATCGCGGC AGTATTAC AG AAAAC CAT 1137 

U09352 1100 C C TAC ACATGG AACCC C ATGC C AG AT CG CGGC AGTATTACAGTTT TC CAT 1149. 

Oklahoma 1088 C C TAC ACATGGAAC CC C ATGC CAG AT CG CGGC AGT ATT ACAG AAAAC CAT 1137 
* **************************************** **** 



12384 1138 T ATG AT CATGTT C ATGT C TC CTTT AATGC T TAA 1170 

2699 1150 TATGAT CATGTT C ATGT C TC CTTT AATGCT TAA 1182 

B514 1117 TATGATCATGTTCATGTCTCCTTTAATGCTTAA 1149 

Spy 5 7 113 8 TATGATCATGTT C ATGT CTCCTTT AATGC TTAA 1170 

U09352 1150 TATGATCATGTT CATGTCTCCTTTAATGC TTAA 1182 

Oklahoma 113 8 TATGATCATGTTCATGTCTCCTTTAATGCTTAA 117 0 
********************************* 
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Figure 18 



12384 1 Mil TKKS LFVTSVALS LAPLATAQAQEWTPRSVTE I KS ELVLVDNVFT YT 50 

2699 1 MIITKKSLFVTSVALSLAPtiATAQAQEWTPRSVTEIKSELVLVDNVFTYI 50 

B514 1 MIITKKSLFVTSVALSLAPLATAQAQEWTPRSVTEIKSELVLVDNVFTYT 50 

Spy57 1 M 1 1 T KKS LF VT S VAL S L VP L AT AQ AQ E WT PR S VTE I KS E L VLVDNVFT YT 50 

U09352 1 Mil TKKS LFVT S VAL S L APLAT AQ AQ E WT P R S VTQ I KS E L VLVDNVFT YT 50 

Oklahoma 1 Mil TKKS L F VT S VAL S L VP LATAQ AQ E WT P RS VTE I KS E L VL VDNVF T YT 50 
***************** **************** t ************** 

12384 51 VKYGDTLSTIAEAMGIDVHVLGDINHrANIDLIFPDTILTANYNQHGQAT 100 

2699 51 VKYGDTLSTIAEAMGIDVHVLGDINHIANIDLIFPDTILTAWYNQHGQAT 100 

B514 51 VKYGDTLSTIAEAMGIDVHVLGDINHIANIDLIFPDTILTANYNQHGQAT 100 

Spy5'7 51 VKYGDTLSTIAEAMGIDVHVLGDINHIANIDLIFPDTILTANYNQHGQAT 100 

U09352 51 VKYGDTLSTIAEAMGIDVHVLGDINHIANIDLIFPDTILTANYNQHGQAT 100 

Oklahoma 51 VKYGDTLSTIAEAMGIDVHVLGDINHIANIDLIFPDTILTANYNQHGQAT 100 
************************************************** 

12384 • 101 TLTVQAPASSPASVSHVPSSEPLPQASATSQSTVPMAPSATPSDVPTTPF 150 

2699 101 TLTVQAPASSPSSVSHVPSSEPLPQASATSQPTVPMAPSATPSDVPTTPF 150 

B514 101 TLTVQAPASSPASVSHVPSSEPLPQASATSQPTVPMAPSATP L 143 

Spy57 101 NLTVQAPASSPASVSHVPSSEPLPQASATSQPTVPMAPPATPSDVPTTPF 150 

U09352 101 TLTVQAPASSPASVSHVPSSEPLPQASATSQSTIPMAPSATPSDVPTTPL 150 

Oklahoma 101 NLTVQAPASSPASVSHVPSSEPLPQASATSQPTVPMAPPATPSDVPTTPF 150 
m ********** m ******************* * **** *** 

12384 151 ASAKPDSSVTASSELTSSTNDVSTELSSESQKQPEVPQEAVPTPKAAETT 200 

2699 151 ASAKPDSSVTASSELTSSTNDVSTELSSESQKQPEVPQEAVPTPKAAEPT 200 

B514 144 ASAKPDSSVTASSELTSSTNDVSTESSSESQKQPEVPQEAVPTPKAAETT 193 

Spy57 151 AS AKPD S S VTASS E LTS S TND VSTE L S SE SQKQ PEVPQE AVPTPKAAETT 200 

U093S2 151 ASAKPDSFVTASSELTSSTNDVSTELSSESQKQPEVPQEAEPTPKAAEST 200 

Oklahoma 151 ASAKPDSSVTASSELTSSTNDVSTELS5ESQKQPEVPQEAVPTPKAAETT 200 
******* ***************** ************** ******* * 

12384 201 EVE PKTD I S ED ST S ANRP VPNE S AS E E VS S AAP AQAP AEKE ETSAP 246 

2699 201 EVEPKTDISEDPTSANRPVPNESASEEASSAAPAQAPAEKEETSQMLTAP 250 

B514 194 EVEPKTDISEDPTSANRPVPNESASEEVSSAAPAQAPAEKE ETSAP 239 

Spy57 • 201- EVEPKTDISEAPTSANRPVPNESASEEVSSAAPAQAPAEKE- ---ETSAP '246 • 

U09352 201 EVEPKTDISEDSTSANRPVPNGSASEEASSAAPAQAPAEKEETSQMLTAP 250 

Oklahoma 201 EVE P KTD I S EAPTS ANR P VPNE S AS E E VS S AAP AQAP AE KE ETSAP 246 

************************************* .** 

12384 247 AAQKAVADTTSVATSNGLSYAPNHAYNPMNAGLQPQTAAFKEEVASAFGI ' 296 

2699 251 AAQKAVADTTSVATSMGLSYAPNHAYNPMNAGLQPQTAAFKEEVASAFGI 300 

B514 240 AAQKAVADTTSVATSNGLSYAPIJHAYNPMNAGLQPQTAAFKEEVASAFGI 289 

Spy57 247 AAQKAVADTTS VAT SNGL S YAPNHAYNPMNAGLQ PQTAAFKE EVAS AFG I 296 

U09352 251 AAQKAVADTTSVATSMGLSYA^NHAYNPMNAGLQPQTAAFKEEVLLPLVL 300 

Oklahoma 247 AAQKAVADTTSVATSNGLSYAPNHAYNPMNAGLQPQTAAFKEEVASAFGI 296 
******************************************** 

12384 297 TS FSGYRPGDPGDHGKGLAIDFMVPENS ALGDQVAQYAIDHMAERG I S YV 346 

2 699 301 TS FSG YRPGDPGDHGKGLAIDFMVPVS STLGDQVAQYAIDHMAERG I S YV 35 0 

B514 290 TS FSGYRPGDPGDHGKGLAIDFMVPENS ALGDQVAQYAIDHMAERG I S YV 339 

Spy57 297 TSFSGYRPGDPGDHGKGLAIDFMVPENSALGDQVAQYAIDHMAERGISYV 346 

U09352 301 RHLWTVQE IQEI I GKGLAIDFMVPVS STLGDQVAQYAIDHMADGG I S YV 350 

Oklahoma 297 TS FSGYRPGDPGDHGKGLAID FMVPENS ALGDQVAQYAIDHMAERG I SYV 34 6 

*********** * ************** ***** 
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12384 347 IWKQRF YAP FAS IYGPAYTWNPMPDRGS ITENHYDHVHVSFNA 389 

2699 351 IWKQRF YAP FAS IYGPAYTWNPMPDRGS ITENHYDHVHVSFNA 393 

B514 34 0 IWKQRF YAP FAS IYGPAYTWNPMPDRGS ITENHYDHVHVSFNA 382 

Spy57 347 IVJKQRFYAPFAS IYGPAYTWNPMPDRGS ITENHYDHVHVSFNA 389 

U09352 351 I WKQRFYAPFAS IYGPAYTWNPMPDRGS ITVFHYDHVHVSFNA 393 

Oklahoma 347 I WKQRFYAPF AS IYGPAYTWNPMPDRGS ITENHYDHVHVSFNA 3 89 
****************************** *********** 
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Figure 19 

1 ATGAAGAAAA GAATGTTATT AGCGTCAACA GTAGCCTTGT CATTTGCCCC 
51 AGTATTGGCA ACTCAAGCAG AAGAAGTTCT TTGGACTGCA CGTAGTGTTG 
101 AGCAAATCCA AAACGATTTG ACTAAAACGG ACAACAAAAC AAGTTATACC 
151 GTACAGTATG GTGATACTTT GAGCACCATT GCAGAAGCCT TGGGTGTAGA 
2 01 TGTCACAGTG CTTGCGAATC TGAACAAAAT CACTAATATG GACTTGATTT 

2 51 TCCCAGAAAC TGTTTTGACA ACGACTGTCA ATGAAG C AG A AGAAGTAACA 

3 01 GAAGTTGAAA TCCAAACACC TCAAGCAGAC TCTAGTGAAG AAGTGACAAC 

3 51 TGCGACAGCA GATTTGACCA CTAATCAAGT GACCGTTGAT GATCAAACTG 

4 01 TTCAGGTTGC AGACCTTTCT CAACCAATTG CAGAAGTTAC AAAGACAGTG 
451 ATTGCTTCTG AAGAAGTGGC ACCATCTACG GGCACTTCTG TCCCAGAGGA 

5 01 GCAAACGACC GAAACAACTC GCCCAGTTGA AGAAGCAACT CCTCAGGAAA 
551 CGACTCCAGC TGAGAAGCAG GAAACACAAG CAAGCCCTCA AGCTGCATCA 
601 GCAGTGGAAG TAACTACAAC AAGTTCAGAA GCAAAAGAAG TAGCATCATC 
651 AAATGGAGCT ACAGCAGCAG TTTCTACTTA TCAACCAGAA GAGACGAAAA 
7 01 TAATTTCAAC AACTTACGAG GCTCCAGCTG CGCCCGATTA TGCTGGACTT 

7 51 GCAGTAGCAA AATCT GAAAA TGCAGGTCTT CAACCACAAA CAGCTGCCTT 

8 01 TAAAGAAGAA ATTGCTAACT TGTTTGGCAT TACATCCTTT AGTGGTTATC 

8 51 GTCCAGGAGA CAGTGGAGAT • CACGGAAAAG GTTTGGCTAT CGACTTTATG 

9 01 GTACCAGAAC GTTCAGAATT AGGGGATAAG ATTGCGGAAT ATGCTATTCA 
951 AAATATGGCC AGCCGTGGCA TTAGTTACAT CATCTGGAAA CAACGTTTCT 

10 01 ATGCTCCATT CGATAGCAAA TATGGGCCAG CTAACACTTG GAACCCAATG 
1051 CCAGACCGTG GTAGTGTGAC AGAAAATCAC TATGATCACG TTCACGTTTC 
1101 AATGAATGGA TAA (SEQ ID NO: 17) 



Figure 20 

1 MKKRMLLAST VALSFAPVLA TQAEEVLWTA RSVEQIQNDL TKTDNKTSYT 
51 VQYGDTLSTI AEALGVDVTV LANLNKITNM DLIFPETVLT TTVNEAEEVT 
101 EVEIQTPQAD SSEEVTTATA DLTTNQVTVD DQTVQVADLS QPIAEVTKTV 
151 IASEEVAPST GTSVPEEQTT ETTRPVEEAT PQETTPAEKQ ETQASPQAAS 
2 01 AVEVTTTSSE AKEVASSMGA TAAVSTYQPE ETKIISTTYE APAAPDYAGL 

2 51 AVAKSEJTAGL QPQTAAFKEE IANLFGITSF SGYRPGDSGD HGKGLAIDFM 

3 01 VPERSELGDK IAEYAIQNMA SRGISYIIWK QRFYAPFDSK YGPANTWNPM 
351 PDRGSV'TENH YDHVHVSMNG * (SEQ ID NO: 18) - 



15/15 



